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(57) Abstract 

The invention describes a mediod for Isolating one or more genetic elements encoding a gene product having a desired activity, 
comprising the steps of: (a) compartmentalising genetic elements into microcapsules; (b) expressing the genetic elements to produce their 
respective gene products witfiin the microcapsules; (c) sorting die genetic elements which produce the gene product having the desired 
activity using a change In the optical properties of the genetic elements. The invention enables the in vitro evolution of nucleic acids and 
proteins by repeated mutagenesis and iterative applications of the method of the invention. 
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OPTICAL SORTING METHOD 



The present invention relates to methods for use in in vitro evolution of molecular 
libraries. In particular, the present invention relates to methods of selecting nucleic acids 
5 encoding gene products in which the nucleic acid and the activity of the encoded gene 
product are linked by compartmentation. 

Evolution requires the generation of genetic diversity (diversity in nucleic acid) followed 
by the selection of those nucleic acids which result in beneficial characteristics. Because 

10 the nucleic acid and the activity of the encoded gene product of an organism are 
physically linked (the nucleic acids being confined within the cells which they encode) 
multiple rounds of mutation and selection can result in the progressive siurvival of 
organisms with increasing fitness. Systems for rapid evolution of nucleic acids or 
proteins in vitro advantageously mimic this process at the molecular level in that the 

15 nucleic acid and the activity of the encoded gene product are linked and the activity of the 
gene product is selectable. 

Recent advances in molecular biology have allowed some molecules to be co-selected 
according to their properties along with the nucleic acids that encode them. The selected 
20 nucleic acids can subsequently be cloned for further analysis or use, or subjected to 
additional roimds of mutation and selection. 

Common to these methods is the establishment of large libraries of nucleic acids. 
Molecules having the desu^d characteristics (activity) can be isolated through selection 
25 regimes that select for the desired activity of the encoded gene product, such as a desired 
biochemical or biological activity, for example binding activity. 

Phage display technology has been highly successfiil as providing a vehicle that allows for 
the selection of a displayed protein by providing the essential link between nucleic acid 
30 and the activity of the encoded gene product (Smith, 1985; Bass et al. , 1990; McCafferty 
et al., 1990; for review see Clackson and Wells, 1994). Filamentous phage particles act as 
genetic display packages with proteins on the outside and the genetic elements which 
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encode them on the inside. The light linkage between nucleic acid and the activity of the 
encoded gene product is a result of the assembly of the phage within bacteria. As 
individual bacteria are rarely multiply infected, in most cases all the phage produced from 
an individual bacterium will carry the same genetic element and display the same protein. 

5 

However, phage display relies upon the creation of nucleic acid libraries in vivo in 
bacteria. Thus, the practical limitation on library size allowed by phage display 
technology is of the order of lO^ to 10l^ even taking advantage of X. phage vectors with 
excisable filamentous phage replicons. The technique has mainly been applied to 

10 selection of molecules with binding activity. A small number of proteins with catalytic 
activity have also been isolated using diis technique, however, selection was not directly 
for the desired catalytic activity, but either for biiiding to a transition-state analogue 
(Widersten and Mannervik, 1995) or reaction with a suicide mhibitor (Soumillion et al., 
1994; Janda et al., 1997). More recently there have been some examples of enzymes 

15 selected using phage-display by product formation (Atwell & Wells, 1999; Demartis et 
al, 1999; Jestin et al, 1999; Pederson, et a/., 1998), but in all these cases selection was 
not for multiple turnover. 

Specific peptide iigands have been selected for binding to receptors by affmity selection 
20 using large libraries of peptides linked to the C terminus of the lac repressor Lad (Cull et 
al, 1992). When expressed in £. coli the repressor protein physically links the ligand to 
the encoding plasmid by binding to a lac operator sequence on the plasmid. 

An entirely in vitro polysome display system has also been reported (Mattheakis et al., 
25 1994; Hanes and Pluckthun, 1997) in which nascent peptides are physically attached via 
the ribosome to the RNA which encodes them. An alternative, entirely in vitro system for 
linking genotype to phenotype by making RNA-peptide fusions (Roberts and Szostak, 
1997; Nemoto et al., 1997) has also been described. 

30 However, the scope of the above systems is limited to tiie selection of proteins and 
furthermore does not allow direct selection for activities other than binding, for example 
catalytic or regulatory activity. 
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In vitro RNA selection and evolution (Ellington and Szostak, 1990), sometimes referred 
to as SELEX (systematic evolution of ligands by exponential enrichment) (Tuerk and 
Gold, 1990) allows for selection for both binding and chemical activity, but only for 

5 nucleic acids. When selection is for binding, a pool of nucleic acids is incubated with 
immobilised substrate. Non-binders are washed away, then the binders are released, 
amplified and the whole process is repeated in iterative steps to enrich for better binding 
sequences. This method can also be adapted to allow isolation of catalytic RNA and 
DNA (Green and Szostak, 1992; for reviews see Chapman and Szostak, 1994; Joyce, 

10 1994; Gold et al., 1995; Moore, 1995). 

However, selection for "catalytic" or binding activity using SELEX is only possible 
because the same molecule performs the dual role of carrying the genetic information and 
being the catalyst or binding molecule (aptamer). When selection is for "auto-catalysis" 

15 the same molecule must also perform the third role of being a substrate. Since the genetic 
element must play the role of both the substrate and the catalyst, selection is only possible 
for single turnover events. Because the "catalyst" is in this process itself modified, it is by 
definition not a true catalyst. Additionally, proteins may not be selected using the SELEX 
procedure. The range of catalysts, substrates and reactions which can be selected is 

20 therefore severely limited. 

Those of the above methods that allow for iterative rounds of mutation and selection are 
mimicking in vitro mechanisms usually ascribed to the process of evolution: iterative 
variation, progressive selection for a desired the activity and replication. However, none 

25 of the methods so far developed have provided molecules of comparable diversity and 
functional efficacy to those that are found naturally. Additionally, there are no man-made 
"evolution" systems which can evolve both nucleic acids and proteins to effect the fiill 
range of biochemical and biological activities (for example, binding, catalytic and 
regulatory activities) and that can combine several processes leading to a desired product 

30 or activity. 
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There is thus a great need for an in viiro system that overcomes the limitations discussed 
above. 

In Tawfik and Griffiths (1998), and in International patent application PCT/GB98/01889, 
5 we describe a system for in vitro evolution that overcomes many of the limitations 
described above by using compartmentalisation in microcapsules to link genotype and 
phenotype at the molecular level. 

In Tawfik and GrifiBths (1998), and in several embodiments of International patent 
10 application PCT/GB98/01889, the desired activity of a gene product results in a 
modification of the genetic element which encoded it (and is present in the same 
microcapsule). The modified genetic element can then be selected in a subsequent step. 

Here we describe a fiuther invention in which the modification of the genetic element 
15 causes a change in the optical properties of the element itself, and which has many 
advantages over the methods described previously. 



BRIEF DESCRIPTION OF THE INVENTION 

20 According to a first aspect of the present invention, there is provided a method for 
isolating one or more genetic elements encoding a gene product having a desired activity 
the expression of which may result, directly or indirectly, in the modification of an optical 
property of a genetic element encoding the gene product, comprising the steps of: 
(a) compartmentalising genetic elements into microcapsules; 
25 (b) expressing the genetic elements to produce their respective gene products within 
the microcapsules; 

(c) sorting the genetic elements which produce the gene product(s) having the desired 
activity according to the changed optical properties of the genetic elements. 



30 The microcapsules according to the present invention compartmentalise genetic elements 
and gene products such that they remain physically linked together. 
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As used herein, a genetic element is a molecule or molecular construct comprising a 
nucleic acid. The genetic elements of the present invention may comprise any nucleic 
acid (for example, DNA, RNA or any analogue, natural or artificial, thereof). The nucleic 
acid component of the genetic element may moreover be linked, covalently or non- 
5 covalently, to one or more molecules or structures, including proteins, chemical entities 
and groups, and solid-phase supports such as beads (including nonmagnetic, magnetic and 
paramz^netic beads), and the like. In the method of the invention, these structures or 
molecules can be designed to assist in the sorting and/or isolation of the genetic element 
encoding a gene product with the desired activity. 

10 

Expression, as used herein, is used in its broadest meaning, to signify that a nucleic acid 
contained in the genetic element is converted into its gene product. Thus, where the 
nucleic acid is DNA, expression refers to die transcription of the DNA into RNA; where 
this RNA codes for protein, expression may also refer to the translation of the RNA into 

15 protein. Where the nucleic acid is RNA, expression may refer to the replication of this 
RNA into further RNA copies, the reverse transcription of the RNA into DNA and 
optionally the transcription of this DNA into further RNA molecule(s), as well as 
optionally the translation of any of the RNA species produced into protein. Preferably, 
therefore, expression is performed by one or more processes selected from the group 

20 consisting of transcription, reverse transcription, replication and translation. 

Expression of the genetic element may thus be directed into either DNA, RNA or protein, 
or a nucleic acid or protein containing imnatural bases or ammo acids (the gene product) 
within the microcapsule of the invention, so that the gene product is confined within the 
25 same microcapsule as the genetic element 

The genetic element and the gene product thereby encoded are linked by confining each 
genetic element and the respective gene product encoded by the genetic element within 
the same microcapsule. In this way the gene product in one microcapsule cannot cause a 
30 change in any other microcapsules. In addition, further linking means may be employed 
to link gene products to the genetic elements encoding them, as set forth below. 
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The term "microcapsule" is used herein in accordance with the meaning normally 
assigned thereto in the art and further described hereinbelow. In essence, however, a 
microcapsule is an artificial compartment whose delimiting borders restrict the exchange 
of the components of the molecular mechanisms described herein which allow the sorting 
5 of the genetic elements according to the function of the gene products which they encode. 

Preferably, the microcapsules used in the method of the present invention will be capable 
of being produced in very large numbers, and thereby to compartmentsdise a library of 
genetic elements which encodes a repertoire of gene products. 

10 

As used herein, a change in optical properties of the genetic elements refers to any change 
in absorption or emission of electromagnetic radiation, including changes in absorbance, 
luminescence, phosphorescence or fluorescence. All such properties are included in the 
term "optical". Genetic elements can be sorted, for example, by luminescence, 
15 fluorescence or phosphorescence activated sorting. In a preferred embodiment, flow 
cytometry is employed to sort genetic elements, for example, light scattering (Kerker, 
1983) and fluorescence polarisation (Rolland et al., 1985) can be used to trigger flow 
sorting. In a highly preferred embodiment genetic elements are sorted using a fluorescence 
activated cell sorter (FACS) sorter (Norman, 1980; Mackenzie and Pinder, 1986). 

20 

Changes in optical properties may be direct or indirect Thus, the change may result in 
the alteration of an optical property in the genetic element itself, or may lead indirectly to 
such a change. For example, modification of a genetic element may alter its ability to 
bind an optically active ligand, thus indirectly altering its optical properties. 

25 

Alternatively, imaging techniques can be used to screen thin films of genetic elements to 
allow enrichment for a genetic element with desirable properties, for example by physical 
isolation of the region where a genetic element with desirable properties is situated, or 
ablation of non-desired genetic elements. The genetic elements can be detected by 
30 luminescence, phosphorescence or fluorescence. 
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According to a preferred embodiment of the first aspect of the present invention, the 
sorting of genetic elements may be performed in one of essentially two techniques. 

(I) In a first embodiment, the genetic elements are sorted following pooling of the 
5 microcapsules into one or more common compartments. In this embodiment, a gene 
product having the desired activity modifies the genetic element which encoded it (and 
which resides in the same microcapsule) so as to make it selectable as a result of its 
modified optical properties in a subsequent step. The reactions are stopped and the 
microcapsules are then broken so that all the contents of the individual microcapsules are 
10 pooled. The modification of the genetic element in the microcapsule may result directly 
in the modification of the optical properties of the genetic element. Alternatively, the 
modification may allow the genetic elements to be fiirther modified outside die 
microcapsules so as to induce a change in their optical properties. Selection for the 
genetic elements with modified optical properties enables enrichment of the genetic 
15 elements encoding the gene product(s) having the desired activity. Accordingly, the 
invention provides a method according to the first aspect of the invention, wherein in step 
(b) the gene product having the desired activity modifies the genetic element encoding it 
to enable the isolation of the genetic element as a result in a change in the optical 
properties of the genetic element. It is to be understood, of course, that modification may 
20 be direct, in that it is caused by the direct action of the gene product on the genetic 
element, or indirect, in which a series of reactions, one or more of which involve the gene 
product having the desired activity, leads to modification of the genetic element 

(II) In a second embodunent, the genetic elements may be sorted by a multi-step 
25 procedure, which involves at least two steps, for example, in order to allow the exposure 
of the genetic elements to conditions which permit at least two separate reactions to occur. 
As will be apparent to persons skilled in the art, the first niicroencq)suiation step of the 
invention advantageously results in conditions which permit the expression of the genetic 
elements - be it transcription, transcription and/or translation, replication or the like. 
30 Under these conditions, it may not be possible to select for a particular gene product 
activity, for example because the gene product may not be active under these conditions, 
or because the expression system contains an interfering activity. The invention therefore 
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provides a method according to the first aspect of the present invention, wherein step (b) 
comprises expressing the genetic elements to produce their respective gene products 
within the microcapsules, linking the gene products to the genetic elements encoding 
them and isolating the complexes thereby formed. This allows for the genetic elements 

5 and their associated gene products to be isolated from the capsules before sorting 
according to gene product activity takes place. In a preferred embodiment, the complexes 
are subjected to a further compartmentalisation step prior to isolating the genetic elements 
encoding a gene product having the desired activity. This fiirther compartmentalisation 
step, which advantageously takes place in microcapsules, permits the performance of 

10 further reactions, under different conditions, in an environment where the genetic 
elements and their respective gene products are physically linked. Eventual sorting of 
genetic elements may be performed according to embodiment (I) above. 

The "secondary encapsulation" may also be performed with genetic elements linked to 
15 gene products by other means, such as by phage display, polysome display, RNA-peptide 
fusion or lac repressor peptide fiision. 

The selected genetic element(s) may also be subjected to subsequent, optionally more 
stringent rounds of sorting in iteratively repeated steps, reapplying the method of the 
20 invention either in its entirety or in selected steps only. By tailoring the conditions 
appropriately, genetic elements encoding gene products having a better optimised activity 
may be isolated after each round of selection. 

Additionally, the genetic elements isolated after a first round of sorting may be subjected 
25 to mutagenesis before repeating the sorting by iterative repetition of the steps of the 
method of the invention as set out above. After each round of mutagenesis, some genetic 
elements will have been modified in such a way that the activity of the gene products is 
enhanced. 



30 



Moreover, the selected genetic elements can be cloned into an expression vector to allow 
further characterisation of the genetic elements and their products. 
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In a second aspect, the invention provides a product when selected according to the first 
aspect of the invention. As used in this context, a "product" may refer to a gene product, 
selectable according to the invention, or the genetic element (or genetic information 
comprised therein). 

5 

In a third aspect, the invention provides a method for preparing a gene product, the 
expression of which may result, directly or indirectly, in die modification the optical 
properties of a genetic element encoding it, comprising the steps of: 
(a) preparing a genetic element encoding the gene product; 
1 0 (b) compartmentalising genetic elements into microcapsules; 

(c) expressing the genetic elements to produce their respective gene products within 
the microcapsules; 

(d) sorting the genetic elements which produce the gene product(s) having the desired 
activity using the changed optical properties of the genetic elements; and 

1 5 (e) expressing the gene product having the desired activity . 

In accordance with the third aspect, step (a) preferably comprises preparing a repertoire of 
genetic elements, wherein each genetic element encodes a potentially differing gene 
product. Repertoires may be. generated by conventional techniques, such as those 

20 employed for the generation of libraries intended for selection by methods such as phage 
display. Gene products having the desired activity may be selected from the repertoire, 
according to the present invention, according to their ability to modify the optical 
properties of the genetic elements in a maimer wWch differs from that of other gene 
products. For example, desired gene products may modify the optical properties to a 

25 greater extent than other gene products, or to a lesser extent, including not at all. 

In a fourth aspect, the invention provides a method for screening a compound or 
compounds capable of modulation the activity of a gene product, the expression of which 
may result, directly or indirectly, in the modification of the optical properties of a genetic 
30 element encoding it, comprising the steps of: 

(a) preparing a repertoire of genetic elements encoding gene product; 

(b) compartmentalising genetic elements into microcapsules; 
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(c) expressing the genetic elements to produce their respective gene products within the 
microcapsules; 

(d) sorting the genetic elements which produce the gene product(s) having the desired 
activity using the changed optical properties of the genetic elements; and 

5 (e) contacting a gene product having the desired activity with the compound or 
compounds and monitoring the modulation of an activity of the gene product by the 
compound or compounds. 

Advantageously, the method further comprises the step of: 
10 (g) identifying the compound or compounds capable of modulating the activity of the 
gene product and synthesising said compound or compounds. 

This selection system can be configured to select for RNA, DNA or protein molecules 
with catalytic, regulatory or binding activity. 

15 

Brief Description of the Figures 

Figure 1. Dihydrofolate reductase can be expressed from genes in vitro translated in 
solution and genes attached to paramagnetic beads with identical efficiency. The DHFR 
20 activity resulting from in vitro translation offolA genes in solution or folA genes attached 
to paramagnetic microbeads is determined by monitoring the oxidation of NADPH to 
NADP spectrophotometrically at 340 nm and activity is calculated by initial velocities 
under So»Km conditions (umax). (♦), translated from genes in solution; (■), translated 
from genes attached to microbeads.2. 

25 

F^re 2. Epifluorescence microscopy of water-in-oil emulsions demonstrating that GPP 
can be translated in vitro fi'om genes attached to single microbeads encapsulated in the 
aqueous compartments of the emulsions and the translated gene-product bound back the 
microbeads making them fluorescent, 

30 

Figure 3. Flow cytometric analysis of GFP expression in microcapsules and in situ 
binding to the genetic element (microbeads). A: The light scattering characteristics of the 



WOOO/40712 PCT/GBOa/00030 

II 

beads before reaction. 75% of beads run as single beads. B: The light scattering 
characteristics of the beads after in vitro translation reaction. About 50% of beads fall into 
the gate for single beads. C: Fluorescence from microbeads (gated for single beads only) 
coated with T7-GFP gene and anti-GFP polyclonal antibody is significantly higher than 
5 the signal from the beads where either the GFP gene or the anti-GFP antibody were 
omitted. 

Figure 4. Synthesis of Biotin-GS-DNP by the human glutathione S-transferase M2'2 
(GST M2-2) catalysed reaction of I'Chloro-2J-dinitrobenzene (CDNB; Sigma) with 
1 0 reduced biotinylated-glutathione (Biotin-GSH). 

Figure 5. Detecting paramagnetic beads coated with the product of an enzyme catalysed 
reaction by flow cytometry. Sera-Mag™ streptavidin-coated magnetic microparticles 
incubated with Biotin-GS-DNP made by the GST M2-2 catalysed reaction of Biotm-GSH 

15 and CDNB. The captured Biotin-GS-DNP was detected by mcubation of the 
microparticles with a mouse anti-dinitrophenol antibody followed by a (FITC>conjugated 
F(ab02 fragment goat anti-mouse IgG, F(ab')2 fragment. After washing, 2 x 10^ 
microparticles were analysed by flow cytometry. All reagents, no reagents omitted from 
the enzymatic synthesis of with Biotm-GS-DNP; minus GST, the enzyme GST M2-2 was 

20 omitted from the synthesis; minus biotin-GSH, biotin-GSH was omitted from the 
synthesis; minus CDNB, CDNB was omitted from the synthesis. 

Figure 6. Synthesis ofMeNPO-CO-Biotin-p-Ala-GSH (caged-biotin-fiala-GSH). 
Acetyl chloride (5 ml) was added to anhydrous methanol (80 ml). The stirred solution was 
25 allowed to cool down and d-biotin (4 g) was added. After over-night stirring the solvents 
were evaporated in vacuum to afford a white solid. The solid was triturated with ether, 
filtered and dried under vacuum (in the presence of phosphorus pentoxide) and stored at - 
20*'C. 



30 Figure 7. Reaction ofcaged-biotin-pala-GSHwith l'ChlorO'2A'dinitrobenzene (CDNB) 
and photochemical uncaging of the biotin group. 
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Figure 8. Reaction of caged-biotin-Pala-GSH with 4-chloro-3'nitrobenzoate (CNB) and 
photochemical uncaging of the biotin group 

Figure 9. Human GST M2-2 catalyses the reaction of caged-biotin-Pala-GSH with 
5 CDNB and CNB in solution and the reaction products can be uncaged by UV irradiation, 
captured on beads and detected using fluorescently labelled anti-product antibodies and 
flow cytometry. 

Panel A: light scattering characteristics of beads and gate for single beads (Rl). Panel B: 
fluorescence from microbeads (gated through Rl) from reactions with CDNB. Panel C: 
10 fluorescence from microbeads (gated through Rl) from reactions with CNB. Signals 
from microbeads from reactions with and without GST M2-2 are annotated +enz and -enz 
respectively. Signals from microbeads from reactions which were UV uradiated and those 
which were not are annotated +UV and -UV respectively. 

15 Figure 10. Flow cytometry can be used to distinguish beads from aqueous compartments 
of an emulsion containing GST M2-2 from beads from compartments without GST M2-2 
by using caged-biotinylated-pAla-GSH and CNB as substrates. 

Panel A: light scattering characteristics of a mixture of a mixture of 1.0 \im diameter 
nonfluorescent neutravidin labelled microspheres (Molecular Probes, F-8777) or 0.93 jim 

20 diameter streptavidin-coated polystyrene beads (Bangs Laboratories) and gates set for 
single Bangs beads (Rl) and single Molecular Probes beads (R2). Panel B: fluorescence 
from microbeads taken from a non-emulsified mixture of 98% Bangs beads (without 
GST) and 2% Molecular Probes beads (with GST). Panel C: fluorescence from 
microbeads taken from a mixture of two emulsions in a ratio of 98% emulsion containing 

25 Bangs beads (without GST) and an emulsion containing 2% Molecular Probes beads 
(with GST). Panel D: fluorescence from microbeads taken from a non-emulsified mixture 
of 98% Molecular Probes beads (without GST) and 2% Bangs beads (with GST). Panel 
E: fluorescence from microbeads taken from a mixture of two emulsions in a ratio of 
98% emulsion containing Molecular Probes beads (without GST) and an emulsion 

30 containing 2% Bangs beads (with GST). Fluorescence of ungated beads (No gate), beads 
gated through Rl (Rl ) and beads gated through R2 (R2) are overiayed. 
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Figure 11. Human GST M2'2 transcribed and translated in vitro in the aqueous 
compartments of a water-in oil emulsion catalyses a reaction which gives rise to a change 
in the fluorescence properties of co-compartmentalised microspheres, 

5 Panel A: light scattering characteristics of beads and gate for single beads (Rl). Panel B: 
fluorescence from microbeads (gated through Rl) from non-emuisified reactions. Panel 
C: fluorescence from microbeads (gated through Rl) emulsified reactions. Signals from 
microbeads from reactions with and without GSTM2-2.LMB2-3 DNA are annotated 
+DNA and -DNA respectively. Signals from microbeads from reactions with and without 

1 0 recombinant GST M2-2 are annotated +GST and -GST respectively. 

Figure 12. Synthesis of the caged-biotinylated substrate EtNP-BzGlu-^agedBiotin (17), 

15 Figure 13. Hydrolysis of the PTE substrate EtNP-Bz-Glu-cagedBiotin (17) to yield the 
product Et'Bz-GlU'CagedBiotin, and uncaging of both substrate and product to yield the 
corresponding biotinylated substrate (EtNP-Bz-Glu-Biotin) and product (EtMP-Bz-Glu- 
Biotin) 

20 Figure 14. Preparation of protein conjugates of a PTE substrate and product for 
immunisation and ELISA. 

Figure 15. PTE catalyses the reaction of EtMP-Bz-Glu-cagedBiotin in the presence of 
25 streptavidin-coated beads, and the reaction products uncaged by UV irradiation, are 
captured on beads and detected using fluorescently labelled anti-product antibodies and 
flow cytometry. 

Panel A: light scattering characteristics of the beads and gate selected for single beads 
(R2). Panel B: fluorescence from beads (gated through R2) from reactions with 10 jiM 
30 EtNP-Bz-Glu-cagedBiotin in the presence of in vitro translated OPD.LMB3-2biotin DNA 
fragments (OPD) or M.Hae///.LMB3-2biotin DNA fragments (M./fadU). Panel C: As B 



wo 00/40712 PCT/GBDO/OOOSO 

14 

but with 20 ^iM EtNP-Bz-Glu-cagedBioiin. Panel D: As B but with 50 ^M EtNP-Bz-Glu- 
cagedBiotin. 

Figure 16. Reaction of ElNP-Bz-Glu-cagedBiotm in the presence of beads to which 
5 genetic elements encoding the phosphotriesterase tagged with the Flag peptide (N-Flag- 

OPD.LMB3'2biotin) or another enzyme (N-Flag'M,HaeIILLMB3'2biotin) were attached 

alongside with an antibody that binds the Flag peptide. The beads were reacted and 

subsequently analysed by flow-cytometry as described in the text. 

Panel A: light scattering characteristics of beads and gate for single beads (Rl). Panel B: 
10 fluorescence from microbeads (gated through Rl) to which were attached N-Fl^- 

OPD.LMB3-2biotin DNA fragments (OPD) or M.Hae///.LMB3-2biotin DNA fragments 

QA.Hae\ll) from reactions with 12.5 EtNP-Bz-Glu-cagedBiotin. Panel C: As B but 

with 25 EtNP-Bz-Glu-caged-Biotin. 

15 Figure 17. £. coli BirA transcribed and translated in vitro catalyses a reaction which 
gives rise to a change in the fluorescence properties of substrate-labelled microspheres in 
the aqueous compartments of a water-in oil emulsion and in bulk solution. 

20 Figure 18. Flow cytometric analysis of samples prepared for the sorting experiment. 

Figure 19. Fluorescence-activated flow cytometric sorting of the genetic elements. 
Panel A: Samples #1 to #4 before sorting and after sorting. Panel B: Genes recovered 
from individual beads sorted from sample #3 sorted into a 96-well plate. Panel C: Genes 
25 recovered from individual beads sorted from sample #4 sorted into a 96-well plate. DNA 
markers (M) are <j>X174-Haein digest 



(A) GENERAL DESCRIPTION 

30 

The microcapsules of the present invention require appropriate physical properties to 
allow the working of the invention. 
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First, to ensure that the genetic elements and gene products may not diffuse between 
microcapsules, the contents of each microcapsule are preferably isolated from the contents 
of the surrounding microcapsules, so that there is no or little exchange of the genetic 
5 elements and gene products between the microcapsules over the timescale of the 
experiment. 

Second, the method of the present invention requires that there are only a limited number 
of genetic elements per microNcapsule. This ensures that the gene product of an mdividual 

10 genetic element will be isolated from other genetic elements. Thus, couplmg between 
genetic element and gene product will be highly specific. The enrichment factor is 
greatest with on average one or fewer genetic elements per microcapsule, the linkage 
between nucleic acid and the activity of the encoded gene product being as tight as is 
possible, since the gene product of an individual genetic element will be isolated from the 

15 products of all other genetic elements. However, even if the theoretically optimal 
situation of, on average, a single genetic element or less per microcapsule is not used, a 
ratio of 5, 10, 50, 100 or 1000 or more genetic elements per microcapsule may prove 
beneficial in sorting a large library. Subsequent rounds of sorting, includmg renewed 
encapsulation with differing genetic element distribution, will permit more stringent 

20 sorting of the genetic elements. Preferably, there is a single genetic element, or fewer, per 
microcapsule. 

Third, the formation and the composition of the microcapsules advantageously does not 
abolish the function of the machinery the expression of the genetic elements and the 
25 activity of the gene products. 

The appropriate system(s) may vary depending on the precise nature of the requirements 
in each application of the invention, as will be apparent to the skilled person. 

30 A wide variety of microencapsulation procedures are available (see Benita, 1996) and 
may be used to create the microcapsules used in accordance with the present invention. 
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Indeed, more than 200 microencapsulation methods have been identified in the literature 
(Finch, 1993). 

These include membrane enveloped aqueous vesicles such as lipid vesicles (liposomes) 
5 (New, 1990) and non-ionic surfactant vesicles (van Hal et al., 1996). These are closed- 
membranous capsules of single or multiple bilayers of non-covzilently assembled 
molecules, with each bilayer separated from its neighbour by an aqueous compartment. In 
the case of liposomes the membrane is composed of lipid molecules; these are usually 
phospholipids but sterols such as cholesterol may also be incorporated into the 
10 membranes (New, 1990). A variety of en2yme-catalysed biochemical reactions, including 
RNA and DNA polymerisation, can be performed within liposomes (Chakrabarti et al., 
1994; Oberholzer et aL, 1995a; Oberholzer et al., 1995b; Walde et al., 1994; Wick & 
Luisi, 1996). 

15 With a membrane-enveloped vesicle system much of the aqueous phase is outside the 
vesicles and is therefore non-compartmentaiised. This continuous, aqueous phase is 
removed or the biological systems in it inhibited or destroyed (for example, by digestion 
of nucleic acids with DNase or RNase) in order that the reactions are limited to the 
microcapsules (Luisi et al., 1987). 

20 

Enzyme-catalysed biochemical reactions have also been demonstrated in microcapsules 
generated by a variety of other methods. Many enzymes are active in reverse micellar 
solutions (Bru & Walde, 1991; Bru & Walde, 1993; Creagh et al., 1993; Haber et al., 
1993; Kumar et al., 1989; Luisi & B., 1987; Mao & Walde, 1991; Mao et al., 1992; Perez 
25 et al., 1992; Walde et al., 1994; Walde et al., 1993; Walde et al., 1988) such as the AOT- 
isooctane- water system (Menger & Yamada, 1979). 

Microcapsules can also be generated by interfacial polymerisation and interfacial 
complexation (Whateley, 1996). Microcapsules of this sort can have rigid, nonpermeable 
30 membranes, or semipermeable membranes. Semipermeable microcapsules bordered by 
cellulose nitrate membranes, polyamide membranes and lipid-polyamide membranes can 
all support biochemical reactions, including multienzyme systems (Chang, 1987; Chang, 
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1992; Lim, 1984). Alginaie/polylysine microcapsules (Lim & Sun, 1980), which can be 
formed under very mild conditions, have also proven to be very biocompatible, providing, 
for example, an effective method of encapsulating living cells and tissues (Chang, 1992; 
Sunetal., 1992), 

5 

Non-membranous microencapsulation systems based on phase partitioning of an aqueous 
environment in a colloidal system, such as an emulsion, may also be used. 

Preferably, the microcapsules of the present invention are formed from emulsions; 
10 heterogeneous systems of two immiscible liquid phases with one of the phases dispersed 
in the other as droplets of microscopic or colloidal size (Becher, 1957; Sherman, 1968; 
Lissant, 1974; Lissant, 1984). 



Emulsions may be produced from any suitable combination of immiscible liquids. 

IS Preferably the emulsion of the present invention has water (containing the biochemical 
components) as the phase present in the form of finely divided droplets (the disperse, 
internal or discontinuous phase) and a hydrophobic, immiscible liquid (an oil') as the 
matrix in which these droplets are suspended (the nondisperse, continuous or external 
phase). Such emulsions are termed "water-in-oir* (W/0). This has the advantage that the 

20 entire aqueous phase containing the biochemical components is compartmentalised in 
discreet droplets (the internal phase). The external phase, being a hydrophobic oil, 
generally contains none of the biochemical components and hence is inert. 



The emulsion may be stabilised by addition of one or more surface-active agents 
25 (surfactants). These surfactants are termed emulsifying agents and act at the watei/oil 
interface to prevent (or at least delay) separation of the phases. Many oils and many 
emulsifiers can be used for the generation of water-in-oil emulsions; a recent compilation 
listed over 16,000 surfiactants, many of which are used as emulsifying agents (Ash and 
Ash, 1993). Suitable oils include light white mineral oil and non-ionic surfactants 
30 (Schick, 1966) such as sorbitan monooleate (Span™80; ICI) and 
polyoxyethylenesorbitan monooleate (Tween™80; ICI). 



wo 00/40712 PCT/GBOO/00030 

18 

The use of anionic surfactants may also be beneficial. Suitable surfactants include sodium 
cholate and sodium taurocholate. Particularly preferred is sodium deoxycholate, 
preferably at a concentration of 0.5% w/v, or below. Inclusion of such surfactants can in 
some cases increase the expression of the genetic elements and/or the activity of the gene 
5 products. Addition of some anionic surfactants to a non-emulsified reaction mixture 
completely abolishes translation. During emulsification, however, the surfactant is 
transferred from the aqueous phase into the interface and activity is restored. Addition of 
an anionic surfactant to the mixtures to be emulsified ensures that reactions proceed only 
after compartmentaiisation. 

10 

Creation of an emulsion generally requires the application of mechanical energy to force 
the phases together. There are a variety of ways of doing this which utilise a variety of 
mechanical devices, including stirrers (such as magnetic stir-bars, propeller and turbine 
stirrers, paddle devices and whisks), homogenisers (including rotor-stator homogenisers, 
15 high-pressure valve homogenisers and jet homogenisers), colloid mills, ultrasoimd and 
'membrane emulsification' devices (Becher, 1957; Dickinson, 1994). 

Aqueous microcapsules formed in water-in-oil emulsions are generally stable with little if 
any exchange of genetic elements or gene products between microcapsules. Additionally, 
20 we have demonstrated that several biochemical reactions proceed in emulsion 
microcapsules. Moreover, complicated biochemical processes, notably gene transcription 
and translation are also active in emulsion microcapsules. The technology exists to create 
emulsions with volumes all the way up to industrial scales of thousands of litres (Becher, 
1957; Sherman, 1968; Lissant, 1974; Lissant, 1984). 

25 

The preferred microcapsule size will vary depending upon the precise requirements of any 
individual selection process that is to be performed according to the present invention. In 
all cases, there will be an optimal balance between gene library size, the required 
enrichment and the required concentration of components in the individual microcapsules 
30 to achieve efficient expression and reactivity of the gene products. 
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The processes of expression occurs within each individual microcapsule provided by the 
present invention. Both in vitro transcription and coupled transcription-translation 
become less efficient at sub-nanomolar DNA concentrations. Because of the requirement 
for only a limited number of DNA molecules to be present in each microcapsule, this 
5 therefore sets a practical upper limit on the possible microcapsule size. Preferably, the 
mean volume of the microcapsules is less that 5.2 x 10'^^ m^, (corresponding to a 
spherical microcapsule of diariaeter less than lOjim, more preferably less than 6.5 x 10-*"' 
m3 (5 Jim diameter), more preferably about 4.2 x lO-** m^ (2jim diameter) and ideally 
about 9 X 10-1* (2.6fim diameter). 

10 

The effective DNA or RNA concentration in the microcapsules may be artificially 
increased by various methods that will be well-known to those versed in the art. These 
include, for example, the addition of volume excluding chemicals such as polyethylene 
glycols (PEG) and a variety of gene amplification techniques, including transcription 

15 using RNA polymerases including those firom bacteria such as E, coli (Roberts, 1969; 
Blattner and Dahlberg, 1972; Roberts et al., 1975; Rosenberg et al. , 1975) , eukaryotes e. 
g. (Weil et al. , 1979; Manley et al., 1983) and bacteriophage such as T7, T3 and SP6 
(Melton et al., 1984); the polymerase chain reaction (PGR) (Saiki et al, 1988); Qb 
replicase amplification (Miele et al., 1983; Cahill et aL, 1991; Chetverin and Spirin, 1995; 

20 Katanaev et al., 1995); the ligase chain reaction (LCR) (Landegren et al., 1988; Baiany, 
1991); and self-sustained sequence replication system (Fahy et al., 1991) and strand 
displacement amplification (Walker et al., 1992). Gene amplification techniques requiring 
thermal cycling such as PGR and LGR may be used if the emulsions and the in vitro 
transcription or coupled transcription-translation systems are thermostable (for example, 

25 the coupled transcription-translation systems can be made from a thermostable organism 
such as Thermus aquaticus). 

Increasing the effective local nucleic acid concentration enables larger microcapsules to 
be used effectively. This allows a preferred practical upper limit to the microcapsule 
30 volume of about 5.2 x lO'^^m^ (corresponding to a sphere of diameter 1 Ofxm). 
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The microcapsule size is preferably sufficiently large to accommodate all of the required 
components of the biochemical reactions that are needed to occur within the 
microcapsule. For example, in vitro, both transcription reactions and coupled 
transcription-translation reactions require a total nucleoside triphosphate concentration of 
5 about 2mM. 

For example, in order to transcribe a gene to a single short RNA molecule of 500 bases in 
length, this would require a minimum of 500 molecules of nucleoside triphosphate per 
microcapsule (8.33 x 10*^ moles). In order to constitute a 2mM solution, this number of 
10 molecules is contained within a microcapsule of volume 4.17 x 10*^^ litres (4.17 x 10*22 
m3 which if spherical would have a diameter of 93nni. 

Furthermore, particularly in the case of reactions involving translation, it is to be noted 
that the ribosomes necessary for the translation to occur are themselves approximately 
15 20nxn in diameter. Hence, the preferred lower limit for microcapsules is a diameter of 
approximately 0. 1 |im ( 1 OOam). 

Therefore, the microcapsule volume is preferably of the order of between 5.2 x 10"22 
and 5.2 X 10"^^^ m3 corresponding to a sphere of diameter between O.ljimand lOfim, 
20 more preferably of between about 5.2 x lO'^^ m3 and 6.5 x lO'l^ m^ (Ijim and 5nm). 
Sphere diameters of about 2.6|im are most advantageous. 

It is no coincidence that the preferred dunensions of the compartments (droplets of 2.6^m 
mean diameter) closely resemble those of bacteria, for example, Escherichia are 1.1-1.5 x 

25 2.0-6.0 |im rods and Azotobacter are 1.5-2.0 jim diameter ovoid cells. In its simplest 
form, Darwinian evolution is based on a *one genotype one phenotype* mechanism. The 
concentration of a single compartmentalised gene, or genome, drops from 0.4 nM in a 
compartment of 2 \xm diameter, to 25 pM in a compartment of 5 ^m diameter. The 
prokaryotic transcription/translation machinery has evolved to operate in compartments of 

30 -1-2 nm diameter, where single genes are at approximately nanomolar concentrations. A 
single gene, in a compartment of 2.6 ^m diameter is at a concentration of 0.2 nM. This 
gene concentration is high enough for efficient translation. Compartmentalisation in such 
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a volume also ensures that even if only a single molecule of the gene product is formed it 
is present at about 0.2 nM, which is important if the gene product is to have a modifying 
activity of the genetic element itself. The volume of the microcapsule is thus selected 
bearing in mind not only the requirements for transcription and translation of the genetic 
5 element, but also the modifying activity required of the gene product in the method of the 
invention. 

The size of emulsion microcapsules may be varied simply by tailoring the emulsion 
conditions used to form the emulsion according to requirements of the selection system. 
10 The larger the microcapsule size, the larger is the volume that will be required to 
encapsulate a given genetic element library, since the ultimately limiting factor will be the 
size of the microcapsule and thus the number of microcapsules possible per unit volume. 

The size of the microcapsules is selected not only having regard to the requirements of the 
15 transcription/translation system, but also those of the selection system employed for the 
genetic element. Thus, the components of the selection system, such as a chemical 
modification system, may require reaction volumes and/or reagent concentrations which 
are. not optimal for transcription/translation. As set forth herein, such requirements may 
be accommodated by a secondary re-encapsulation step; moreover, they may be 
20 acconmiodated by selecting the microcapsule size in order to maximise 
transcription/translation and selection as a whole. Empirical determination of optimal 
microcapsule volume and reagent concentration, for example as set forth herein, is 
preferred. 

25 A "genetic element" in accordance with the present invention is as described above. 
Preferably, a genetic element is a molecule or construct selected from the group consisting 
of a DNA molecule, an RNA molecule, a partially or wholly artificial nucleic acid 
molecule consisting of exclusively synthetic or a mixture of naturally-occurring and 
synthetic bases, any one of the foregoing linked to a polypeptide, and any one of the 

30 foregoing linked to any other molecular group or construct. Advantageously, the other 
molecular group or construct may be selected from the group consisting of nucleic acids. 
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polymeric substances, particularly beads, for example polystyrene beads, and magnetic or 
paramagnetic substances such as magnetic or paramagnetic beads. 

The nucleic acid portion of the genetic element may comprise suitable regulatory 
5 sequences, such as those required for efficient expression of the gene product, for 
example promoters, enhancers, translational initiation sequences, polyadenylation 
sequences, splice sites and the like. 

As will be apparent from the following, in many cases the polypeptide or other molecular 
10 group or construct is a ligand or a substrate which directly or indirectly binds to or reacts 
with the gene product in order to alter the optical properties of the genetic element. This 
allows the sorting of the genetic element on the basis of the activity of the gene product. 
The ligand or substrate can be connected to the nucleic acid by a variety of means that 
will be apparent to those skilled in the art (see, for example, Hermanson, 1996). 

15 

One way in which the nucleic acid molecule may be linked to a ligand or substrate is 
through biotinylation. This can be done by PCR amplification with a 5 -biotinylation 
primer such that the biotin and nucleic acid are covalently linked. 

20 The ligand or substrate can be attached to the modified nucleic acid by a variety of means 
that will be apparent to those of skill in the art (see, for example, Hermanson, 1996). A 
biotinylated nucleic acid may be coupled to a polystyrene or paramagnetic microbead 
(0.02 to approx. 5.0 jim in diameter) that is coated with avidin or streptavidin, that will 
therefore bind the nucleic acid with very high affinity. This bead can be derivatised with 

25 substrate or ligand by any suitable method such as by adding biotinylated substrate or by 
covaient coupling. 

Alternatively, a biotinylated nucleic acid may be coupled to avidin or streptavidin 
complexed to a large protein molecule such as thyroglobulin (669 Kd) or ferritin (440 
30 Kd). This complex can be derivatised with substrate or ligand, for example by covaient 
coupling to the 8-amino group of lysines or through a non-covalent interaction such sus 
biotin*avidin. 
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The substrate may be present in a form unlinked to the genetic element but containing an 
inactive "tag" that requires a further step to activate it such as photoactivation (e.g. of a 
"caged" biotin analogue, (Sundberg et al., 1995; Pirrung and Huang, 1996)). The catalyst 
5 to be selected then converts the substrate to product. The "tag" is then activated and the 
"tagged" substrate and/or product bound by a tag-binding molecule (e.g. avidin or 
streptavidin) complexed with the nucleic acid. The ratio of substrate to product attached 
to the nucleic acid via the "tag" will therefore reflect the ratio of the substrate and product 
in solution. 

10 

An alternative is to couple the nucleic acid to a product-specific antibody (or other 
product-specific molecule). In this scenario, the substrate (or one of the substrates) is 
present in each microcapsule unlinked to the genetic element, but has a molecular "tag" 
(for example biotm, DIG or DNP or a fluorescent group). When the catalyst to be 
15 selected converts the substrate to product, the product retains the "tag" and is then 
captured in the microcapsule by the product-specific antibody. In this way the genetic 
element only becomes associated with the "tag" when it encodes or produces an enzyme 
capable of converting substrate to product. 

20 The terms "isolating", "sorting" and "selecting", as well as variations thereof, are used 
herein. Isolation, according to the present invention, refers to the process of separating an 
entity firom a heterogeneous population, for example a mixture, such that it is free of at 
least one substance with which it was associated before the isolation process. In a 
preferred embodunent, isolation refers to purification of an entity essentially to 

25 homogeneity. Sorting of an entity refers to the process of preferentially isolating desired 
entities over undesired entities. In as far as this relates to isolation of the desired entities, 
the terms "isolating" and "sorting" are equivalent The method of the present invention 
permits the sortii^ of desired genetic elements from pools (libraries or repertoires) of 
genetic elements which contain the desired genetic element. Selecting is used to refer to 

30 the process (including the sorting process) of isolating an entity according to a particular 
property thereof 
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In a highly preferred application, the method of the present invention is useful for sorting 
libraries of genetic elements. The invention accordingly provides a method according to 
preceding aspects of the invention, wherein the genetic elements are isolated from a 
library of genetic elements encoding a repertoire of gene products. Herein, the terms 
5 "library", "repertoire" and "pool" are used according to their ordinary signification in the 
art, such that a library of genetic elements encodes a repertoire of gene products. In 
general, libraries are constructed from pools of genetic elements and have properties 
which facilitate sorting. 

10 Initial selection of a genetic element from a genetic element library using the present 
invention will in most cases require the screening of a large number of variant genetic 
elements. Libraries of genetic elements can be created in a variety of different ways, 
including the following. 

15 Pools of naturally occurring genetic elements can be cloned from genomic DNA or cDNA 
(Sambrook et al., 1989); for example, phage antibody libraries, made by PGR 
amplification repertoires of antibody genes from inununised or unimmunised donors have 
proved very effective sources of functional antibody fragments (Winter et al., 1994; 
Hoogenboom, 1997). Libraries of genes can also be made by encoding all (see for 

20 example Smith, 1985; Parmley and Smith, 1988) or part of genes (see for example 
Lowman et al, 1991) or pools of genes (see for example Nissim et al,, 1994) by a 
randomised or doped synthetic oligonucleotide. Libraries can also be made by 
introducing mutations into a genetic element or pool of genetic elements 'randomly* by a 
variety of techniques in vivo, including; using mutator strains of bacteria such as E. cqli 

25 mutD5 (Liao et al., 1986; Yamagishi et al., 1990; Low et al., 1996); using the antibody 
hypermutation system of B-lymphocytes (Yelamos et al., 1995). Random mutations can 
also be introduced both in vivo and in vitro by chemical mutagens, and ionising or UV 
irradiation (see Friedberg et al., 1995), or incorporation of mutagenic base analogues 
(Freese, 1959; Zaccolo et al., 1996). Random' mutations can also be introduced into 

30 genes in vitro during polymerisation for example by using error-prone polymerases 
(Leung etal., 1989). 
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Further diversification can be introduced by using homologous recombination either in 
vivo (see Kowalczykowski et al., 1994) or in vitro (Stemmer, 1994a; Stemmer, 1994b). 

According to a further aspect of the present invention, therefore, there is provided a 
5 method of in vitro evolution comprising the steps of 

(a) selecting one or more genetic elements from a genetic element library according to 
the present invention; 

(b) mutating the selected genetic element(s) in order to generate a further library of 
genetic elements encoding a repertoire to gene products; and 

10 (c) iteratively repeating steps (a) and (b) in order to obtain a gene product with 
enhanced activity. 

Mutations may be introduced mto the genetic elements(s) as set forth above. 

15 The genetic elements according to the invention advantageously encode enzymes, 
preferably of pharmacological or industrial interest, activators or inhibitors, especially of 
biological systems, such as cellular signal transduction mechanisms, antibodies and 
fragments thereof, and other binding agents (e.g. transcription factors) suitable for 
diagnostic and therapeutic applications. In a preferred aspect, therefore, the invention 

20 permits the identification and isolation of clinically or industrially useful products. In a 
fiirther aspect of the invention, there is provided a product when isolated by the method of 
the invention. 

The selection of suitable encapsulation conditions is desirable. Depending on the 
25 complexity and size of the library to be screened, it may be beneficial to set up the 
encapsulation procedure such that 1 or less than 1 genetic element is encapsulated per 
microcapsule. This will provide the greatest power of resolution. Where the library is 
larger and/or more complex, however, this may be impracticable; it may be preferable to 
encapsulate several genetic elements together and rely on repeated application of the 
30 method of the invention to achieve sorting of the desired activity. A combination of 
encapsulation procedures may be used to obtain the desired enrichment. 
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Theoretical studies indicate that the larger the number of genetic element variants created 
the more likely it is that a molecule will be created with the properties desired (see 
Perelson and Oster, 1979 for a description of how this applies to repertoires of 
antibodies). Recently it has also been confirmed practically that larger phage-antibody 
5 repertoires do indeed give rise to more antibodies with better binding affinities than 
smaller repertoires (Griffiths et al., 1994). To ensure that rare variants are generated and 
thus are capable of being selected, a large library size is desirable. Thus, the use of 
optimally small microcapsules is beneficial. 

10 The largest repertoire created to date using methods that require an in vivo step (phage- 
display and Lad systems) has been a 1.6 x 10^1 clone phage-peptide library which 
required the fermentation of 1 5 litres of bacteria (Fisch et al., 1996). SELEX experiments 
are often carried out on very large nimibers of variants (up to 10^ 5). 

15 Using the present invention, at a preferred microcapsule diameter of 2.6^m, a repertoire 
size of at least 10^ ^ can be selected using 1ml aqueous phase in a 20 ml emulsion. 

In addition to the genetic elements described above, the microcapsules according to the 
invention will comprise fiirther components required for the sorting process to take place. 

20 Other components of the system will for example comprise those necessary for 
transcription and/or translation of the genetic element. These are selected for the 
requirements of a specific system fi:om the following; a suitable buffer, an in vitro 
transcription/replication system and/or an in vitro translation system containing all the 
necessary ingredients, enzymes and cofactors, RNA polymerase, nucleotides, nucleic 

25 acids (natimil or synthetic), transfer RNAs, ribosomes and amino acids, and the substrates 
of the reaction of interest in order to allow selection of the modified gene product 

A suitable buffer will be one in which all of the desired components of the biological 
system are active and will therefore depend upon the requirements of each specific 
30 reaction system. Buffers suitable for biological and/or chemical reactions are known in 
the art and recipes provided in various laboratory texts, such as Sambrook et al., 1989. 
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The in vitro translation system will usually comprise a cell extract, typically from bacteria 
(Zubay, 1973; Zubay, 1980; Lesley et ah, 1991; Lesley, 1995), rabbit reticulocytes 
(Pelham and Jackson, 1976), or wheat germ (Anderson et al., 1983). Many suitable 
systems are commercially available (for example from Promega) including some which 
5 will allow coupled transcription/translation (all the bacterial systems and the reticulocyte 
and wheat germ TNT™ extract systems from Promega). The mixture of amino acids used 
may include synthetic amino acids if desired, to increase the possible number or variety of 
proteins produced in the library. This can be accomplished by charging tRNAs with 
artificial amino acids and using these tRNAs for the in vitro translation of the proteins to 
10 be selected (Elhnan et al., 1991; Benner, 1994; Mendel et al., 1995). 

After each round of selection the enrichment of the pool of genetic elements for those 
encoding the molecules of interest can be assayed by non-compartmentalised in vitro 
transcription/replication or coupled transcription-translation reactions. The selected pool 
1 S is cloned into a suitable plasmid vector and RNA or recombinant protein is produced from 
the individual clones for frirther purification and assay. 

In a preferred aspect, the internal environment of a microcapsule may be altered by 
addition of reagents to the oil phase of the emulsion. The reagents diffuse through the oil 
20 phase to the aqueous microcapsule environment. Preferably, the reagents are at least 
partly water-soluble, such that a proportion thereof is distributed frohi the oil phase to the 
aqueous microcapsule environment Advantageously, the reagents are substantially 
insoluble in the oil phase. Reagents are preferably mixed into the oil phase by mechanical 
mixing, for example vonexing. 

25 

The reagents which may be added via the oil phase include substrates, buffering 
components, factors and the like. In particular, the internal pH of microcapsules may be 
altered in situ by adding acidic or basic components to the oil phase. 

30 The invention moreover relates to a method for producing a gene product, once a genetic 
element encoding the gene product has been sorted by the method of the invention. 
Clearly, the genetic element itself may be directly expressed by conventional means to 
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produce the gene product. However, alternative techniques may be employed, as will be 
apparent to those skilled in the art. For example, the genetic information incorporated in 
the gene product may be incorporated into a suitable expression vector, and expressed 
therefrom. 

5 

The invention also describes the use of conventional screening techniques to identify 
compounds which are capable of interacting with the gene products identified by the first 
aspect of the invention. In preferred embodiments, gene product encoding nucleic acid is 
incorporated into a vector, and introduced into suitable host ceils to produce transformed 

10 cell lines that express the gene product. The resulting cell lines can then be produced for 
reproducible qualitative and/or quantitative analysis of the effect(s) of potential drugs 
affecting gene product function. Thus gene product expressing cells may be employed for 
the identification of compounds, particularly small molecular weight compounds, which 
modulate the function of gene product. Thus host cells expressing gene product are useful 

15 for drug screening and it is a further object of the present invention to provide a method 
for identifying compounds which modulate the activity of the gene product, said method 
comprising exposing cells containing heterologous DNA encoding gene producJt, wherein 
said cells produce functional gene product, to at least one compound or mixture of 
compounds or signal whose ability to modulate the activity of said gene product is sought 

20 to be determined, and thereafter monitoring said cells for changes caused by said 
modulation. Such an assay enables the identification of modulators, such as agonists, 
antagonists and allosteric modulators, of the gene product As used herein, a compound 
or signal that modulates the activity of gene product refers to a compound tiiat alters the 
activity of gene product in such a way that the activity of the gene product is different in 

25 the presence of the compound or signal (as compared to the absence of said compound or 
signal). 

Cell-based screening assays can be designed by constructing cell lines in which the 
expression of a reporter protein, i.e. an easily assayable protein, such as p-gaiactosidase, 
30 chloramphenicol acetyltransferase (CAT), green fluorescent protein (GFP) or luciferase, is 
dependent on gene product Such an assay enables the detection of compounds that 
directiy modulate gene product function, such as compounds that antagonise gene 
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product, or compounds that inhibit or potentiate other cellular functions required for the 
activity of gene product. 

The present invention also provides a method to exogenously affect gene product 
5 dependent processes occurring in cells. Recombinant gene product producing host cells, 
e.g. mammalian cells, can be contacted with a test compound, and the modulating 
effect(s) thereof can then be evaluated by comparing the gene product-mediated response 
in the presence and absence of test compound, or relating the gene product-mediated 
response of test cells, or control cells (i.e., cells that do not express gene product), to the 
1 0 presence of the compound. 

In a further aspect, the invention relates to a method for optimising a production process 
which involves at least one step which is facilitated by a polypeptide. For example, the 
step may be a catalytic step, which is facilitated by an enzyme. Thus, the invention 
1 5 provides a method for preparing a compound or compounds comprising the steps of: 

(a) providing a synthesis protocol wherein at least one step is facilitated by a 
polypeptide; 

(b) preparing genetic elements encoding variants of the polypeptide which facilitates 
this step, the expression of which may result, directly or indirectly, in the modification 

20 of the optical properties of the genetic elements; 

(c) compartmentalising genetic elements into microcapsules; 

(d) expressing the genetic elements to produce their respective gene products within 
the microcapsules; 

(e) sorting the genetic elements which produce polypeptide gene product(s) having the 
25 desired activity using the changed optical properties of the genetic elements; and 

(f) preparing the compound or compounds using the polypeptide gene product 
identified in (g) to facilitate the relevant step of the synthesis. 

By means of the invention, enzymes involved in the preparation of a compound may be 
30 optimised by selection for optimal activity. The procedure involves the preparation of 
variants of the polypeptide to be screened, which equate to a library of polypeptides as 
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refereed to herein. The variants may be prepared in the same manner as the libraries 
discussed elsewhere herein. 



5 (B) SELECTION PROCEDURES 

The system can be configured to select for RNA, DNA or protein gene product molecules 
with catalytic, regulatory or binding activity, 

10 (i) SELECTION FOR BINDING 

In the case of selection for a gene product with affinity for a specific ligand the genetic 
element may be linked to the gene product in the microcapsule via the ligand. Only gene 
products with affinity for the ligand will therefore bind to the genetic element and only 
15 those genetic elements with gene product boimd via the ligand will acquire the changed 
optical properties which enable them to be retained in the selection step. In this 

embodiment, the genetic element will thus comprise a nucleic acid encoding the gene 

1 

product linked to a ligand for the gene product. 

20 The change in optical properties of the genetic element after binding of the gene product 
to the ligand may be induced in a variety of ways, includmg: 

(1) the gene product itself may have distinctive optical properties, for example, it is 
fluorescent (e.g. green fluorescent protein, (Lorenz et aL^ 1991)). 

(2) the optical properties of the gene product may be modified on binding to the ligand, 
25 for example, the fluorescence of the gene product is quenched or enhanced on binding 

(Guixe et al., 1998; Qi and Grabowski, 1998) 

(3) the optical properties of the ligand may be modified on binding of the gene product; 
for example, the fluorescence of the ligand is quenched or enhanced on binding (Voss, 
1993; Masui and Kuramitsu, 1998). 

30 (4) the optical properties of both ligand and gene product are modified on binding, for 
example, there can be a fluorescence resonance energy transfer (FRET) from ligand to 
gene product (or vice versa) resulting in enunission at the "acceptor" enunission 
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wavelength when excitation is at the "donor" absoption wavelength (Heim & Tsien. 
1996; Mahajan et 1998; Miyawaki et al, 1997). 

In this embodiment, it is not necessary for binding of the gene product to the genetic 
5 element via the ligand to directly induce a change in optical properties. All the gene 
products to be selected can contain a putative binding domain, which is to be selected for, 
and a common feature - a t2^. The genetic element in each microcapsule is physically 
linked to the ligand. If the gene product produced from the genetic element has aflfinity 
for the ligand, it will bind to it and become physically linked to the same genetic element 
10 that encoded it, resulting in the genetic element being 'tagged'. At the end of the reaction, 
all of the microc^sules are combined, and all genetic elements and gene products pooled 
together in one environment. Genetic elements encoding gene products exhibiting the 
desired binding can be selected by adding reagents which specifically bind to, or react 
specifically with, the *'tag" and thereby induce a change in the optical properties of the 
15 genetic element allowing there sorting. For example, a fluorescently-labelled anti-"tag" 
antibody can be used, or an anti-"tag" antibody followed by a second fluorescently 
labelled antibody which binds the first. 

In an alternative embodiment, genetic elements may be sorted on the basis that the gene 
20 product, which binds to the ligand, merely hides the ligand firom. for example, fiuther 
binding partners which would otherwise modify the optical properties of the genetic 
element. In this case genetic elements with unmodified optical properties would be 
selected. 

25 In an alternative embodiment, the invention provides a method according to the first 
aspect of the invention, wherein in step (b) the gene products bind to genetic elements 
encoding them. The gene products together with the attached genetic elements are then 
sorted as a result of binding of a ligand to gene products having the desired binding 
activity. For example, all gene products can contain an invariant region which binds 

30 covalently or non-covalently to the genetic element, and a second region which is 
diversified so as to generate the desired binding activity. 
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In an alternative embodiment, tiie ligand for the gene product is itself encoded by the 
genetic element and binds to the genetic element. Stated otherwise, the genetic element 
encodes two (or indeed more) gene products, at least one of which binds to the genetic 
element, and which can potentially bind each other. Only when the gene products interact 
5 in a microcapsule is the genetic element modified in a way that ultimately results in a 
change in a change in its optical properties that enables it to be sorted. This embodiment, 
for example, isused to search gene libraries for pairs of genes encoding pairs of proteins 
which bind each other, 

10 Fluorescence may be enhanced by the use of Tyramide Signal Amplification (TSA™) 
amplification to make the genetic elements fluorescent. This involves peroxidase (linked 
to another protein) binding to the genetic elements and catalysing the conversion of 
fluorescein-tyramine in to a fiee radical form which then reacts (locally) with the genetic 
elements. Methods for performing TSA are known in the art, and kits are available 

1 5 conmiercially firom NEN. 

TSA may be configured such that it results in a direct increase in the fluorescence of the genetic 
element, or such that a ligand is attached to the genetic element which is bound by a second 
fluorescent molecule, or a sequence of molecules, one or more of which is fluorescent 

20 

(ii) SELECTION FOR CATALYSIS 

When selection is for catalysis, the genetic element in each microcapsule may comprise 
the substrate of the reaction. If the genetic element encodes a gene product capable of 
25 acting as a catalyst, the gene product will catalyse the conversion of the substrate into the 
product. Therefore, at the end of the reaction the genetic element is physically linked to 
the product of the catalysed reaction. 

It may also be desirable, in some cases, for the substrate not to be a component of the 
30 genetic element. In this case the substrate would contain an mactive "tag" that requires a 
further step to activate it such as photoactivation (e.g. of a "ci^ed" biotin analogue, 
(Sundberg et al., 1995; Pirrung and Huang, 1996)). The catalyst to be selected then 
converts the substrate to product. The "tag" is then activated and the "tagged" substrate 
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and/or product bound by a tag-binding molecule (e.g. avidin or streptavidin) complexed 
with the nucleic acid. The ratio of substrate to product attached to the nucleic acid via the 
"tag" will therefore reflect the ratio of the substrate and product in solution. 

5 The optical properties of genetic elements with product attached and which encode gene 
products with the desired catalytic activity can be modified by either: 

(1) the product-genetic element complex having characteristic optical properties not 
found in the substrate-genetic element complex, due to, for example; 

(a) the substrate and product having different optical properties (many fluorogenic 
10 enzyme substrates are available commercially (see for example Haugland, 1996) 

including substrates for glycosidases, phosphatases, peptidases and proteases (Craig 
et al., 1995; Huang et al., 1992; Brynes et al., 1982; Jones et al., 1997; Matayoshi et 
al., 1990; Wang et al., 1990)), or 

(b) the substrate and product having similar optical properties, but only the product, 
1 5 and not the substrate binds to, or reacts with, the genetic element; 

(2) adding reagents which specifically bind to, or react with, the product and which 
thereby induce a change in the optical properties of the genetic elements allowing their 
sorting (these reagents can be added before or after breaking the microcapsules and 
pooling the genetic elements). The reagents ; 

20 (a) bind specifically to, or react specifically with, die product, and not the substrate, 

if both substrate and product are attached to the genetic element, or 
(b) optionally bind both substrate and product if only the product, and not the 
substrate binds to, or reacts with, the genetic element. 

25 The pooled genetic elements encoding catalytic molecules can then be enriched by 
selecting for the genetic elements with modified optical properties. 

An alternative is to couple the nucleic acid to a product-specific antibody (or other 
product-specific molecule). In this scenario, the substrate (or one of the substrates) is 
30 present in each microcapsule imlinked to the genetic element, but has a molecular "tag" 
(for example biotin, DIG or DNP or a fluorescent group). When the catalyst to be 
selected converts the substrate to product, the product retains the "tag" and is then 
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element only becomes associated with the "tag" when it encodes or produces an enzyme 
capable of converting substrate to product. When all reactions are stopped and the 
microcapsules are combined, the genetic elements encoding active enzymes will be 

5 'lagged" and may already have changed optical properties, for example, if the 'lag" was a 
fluorescent group. Alternatively, a change in optical properties of "tagged" genes can be 
induced by adding a fluorescently labelled ligand which binds the "tag" (for example 
fluorescently-labelled avidin/streptavidin, an anti-"tag" antibody which is fluorescent, or a 
non-fluorescent anti-'*t2^" antibody which can be detected by a second fluorescentiy- 

1 0 labelled antibody). 

Alternatively, selection may be performed indirectly by coupling a first reaction to 
subsequent reactions that takes place in the same microcapsule. There are two general 
ways in which this may be performed. In a first embodiment, the product of the first 
15 reaction is reacted with, or bound by, a molecule which does not react with the substrate 
of the first reaction. A second, coupled reaction will only proceed in the presence of the 
product of the first reaction. A genetic element encoding a gene product vnth a desired 
activity can then be purified by using the properties of the product of the second reaction 
to induce a change in the optical properties of the genetic element as above. 

20 

Altematively, the product of the reaction being selected may be the substrate or 
cofactor tor a second en^me-catalysed reaction. The enzyme to catalyse the second 
reaction can either be translated in situ in the microcapsules or incorporated in the 
reaction mixture prior to microencapsulation. Only when the first reaction proceeds will 
25 the coupled enzyme generate a product which can be used to induce a change in thie 
optical properties of the genetic element as above. 

This concept of coupling can be elaborated to incorporate multiple enzymes, each using as 
a substrate the product of the previous reaction. This allows for selection of enzymes that 
30 will not react with an immobilised substrate. It can also be designed to give increased 
sensitivity by signal amplification if a product of one reaction is a catalyst or a cofactor 
for a second reaction or series of reactions leading to a selectable product (for example, 
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see Johannsson and Bates, 1988; Johannsson, 1991). Furthermore an enzyme cascade 
system can be based on the production of an activator for an enzyme or the destruction of 
an enzyme inhibitor (see Mize et ah, 1989). Coupling also has the advantage that a 
common selection system can be used for a whole group of enzymes which generate the 
5 same product and allows for the selection of complicated chemical transformations that 
cannot be performed in a single step. 

Such a method of coupling thus enables the evolution of novel "metabolic pathways" in 
vitro in a stepwise fashion, selecting and improving first one step and then the next The 
10 selection strategy is based on the final product of the pathway, so that all earlier steps can 
be evolved independently or sequentially without setting up a new selection system for 
each step of the reaction. 



Expressed in an alternative manner, there is provided a method of isolating one or more 
IS genetic elements encoding a gene product having a desired catalytic activity, comprising 
the steps of: 

(1) expressing genetic elements to give their respective gene products; 

(2) allowing the gene products to catalyse conversion of a substrate to a product, which 
may or may not be directly selectable, in accordance with the desired activity; 

20 (3) optionally coupling the first reaction to one or more subsequent reactions, each 
reaction being modulated by the product of the previous reactions, and leading to the 
creation of a final, selectable product; 

(4) Imking the selectable product of catalysis to the genetic elements by either: 

a) coupling a substrate to the genetic elements in such a way that the product 
25 remains associated with the genetic elements, or 

b) reacting or binding the selectable product to the genetic elements by way of a 
suitable molecular "tag" attached to the substrate which remains on the product, 
or 

c) coupling the selectable product (but not the substrate) to the genetic elements 
30 by means of a product-specific reaction or interaction with the product; and 

(5) selecting the product of catalysis, together with the genetic element to which it is 
bound, either by means of its characteristic optical properties, or by adding reagents 
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which specifically bind to, or react specifically with, the product and which thereby 
induce a change in the optical properties of the genetic elements wherem steps (1) to 
(4) each genetic element and respective gene product is contained within a 
microcapsule. 

5 

(iii) SELECTING FOR ENZYME SUBSTRATE SPECIFICITY/SELECTIVITY 

Genetic elements encoding enzymes with substrate specificity or selectivity can be 
specifically enriched by carrying out a positive selection for reaction with one substrate 

10 and a negative selection for reaction with another substrate. Such combined positive and 
negative selection pressure should be of great importance m isolating regio-selective and 
stereo-selective enzymes (for example, enzymes that can distinguish between two 
. enantiomers of the same substrate). For example, two substrates (e.g. two different 
enantiomers) are each labelled with different tags (e.g. two different fluorophores) such 

15 that the tags become attached to the genetic element by the enzyme-catalysed reaction. If 
the two tags confer different optical properties on the genetic element the substrate 
specificity of the enzyme can be determined from the optical properties of the genetic 
element and those genetic elenlents encoding gene products with the wrong (or no) 
specificity rejected. Tags conferring no change in optical activity can also be used if tag- 

20 specific ligands with different optical properties are added (e.g. tag-specific antibodies 
labelled with different fluorophores). 

(iv) SELECTION FOR REGULATION 

25 A similar system can be used to select for regulatory properties of en2ymes. 

In the case of selection for a regulator molecule which acts as an activator or inhibitor of a 
biochemical process, the components of the biochemical process can either be translated 
m situ in each microcapsule or can be incorporated in the reaction mixture prior to 
30 microencapsulation. If the genetic element being selected is to encode an activator, 
selection can be performed for the product of the regulated reaction, as described above in 
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connection with catalysis. If an inhibitor is desired, selection can be for a chemical 
property specific to the substrate of the regulated reaction. 

There is therefore provided a method of sorting one or more genetic elements coding for a 
5 gene product exhibiting a desired regulatory activity, comprising the steps of: 

(1) expressing genetic elements to give their respective gene products; 

(2) allowing the gene products to activate or inhibit a biochemical reaction, or . 
sequence of coupled reactions, in accordance with the desired activity, in such a way as 
to allow the generation or survival of a selectable molecule; 

10 (3) linking the selectable moleciile to the genetic elements either by 

a) having the selectable molecule, or the substrate from which it derives, 
attached to the genetic elements, or 

b) reacting or binding the selectable product to the genetic elements, by way of a 
suitable molecular "tag" attached to the substrate which remains on the product, 

15 or 

c) coupling the product of catalysis (but not the substrate) to the genetic 
elements, by means of a product-specific reaction or interaction with the product; 

(4) selecting the selectable product, together with the genetic element to which it is 
bound, either by means of its characteristic optical properties, or by adding reagents 
20 which specifically bind to, or react specifically with, the product and which thereby 
induce a change in the optical properties of the genetic elements wherein steps (1) to 

(3) each genetic element and respective gene product is contained within a 
microcapsule. 

25 (v) SELECTION FOR OPTICAL PROPERTIES OF THE GENE PRODUCT 

It is possible to select for inherent optical properties of gene products if, in the 
microcapsules, the gene product binds back to the genetic element, for example through a 
common element of the gene product which binds to a ligand which is part of the genetic 
30 element. After pooling the genetic elements they can then be sorted using the optical 
properties of the bound gene products. This embodiment can be used, for example, to 
select variants of green fluorescent protein (GFP) (Cormack et aL, 1996; Delagrave et aL, 
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1995; Ehrig et al, 1995), with improved fluorescence and/or novel absoption and 
emmission spectra. 

(vi) FLOW SORTING OF GENETIC ELEMENTS 

5 

In a preferred embodiment of the invention the genetic elements will be sorted by flow 
cytometry. A variety of optical properties can be used to trigger sorting, including light 
scattering (Kerker, 1983) and fluorescence polarisation (Rolland et al., 1985). In a highly 
preferred embodiment the difference in optical properties of the genetic elements will be a 

10 difference in fluorescence and the genetic elements will be sorted using a fluorescence 
activated cell sorter (Norman, 1980; Mackenzie and Pinder, 1986), or similar device. In 
an especially preferred embodiment the genetic element comprises of a nonfluorescent 
nonmagnetic (e.g. polystyrene) or paramagnetic microbead (see Fomusek and Vetvicka, 
1986), optimally 0.6 to 1.0 [im diameter, to which are attached both the gene and the 

15 groups involved in generating a fluorescent signal: 

(1) commercially available fluorescence activated cell sorting equipment from 
established manufacturers (e.g. Becton-Dickinson, Coulter) allows the sorting of up to 
10* genetic elements (events) per hour; 

(2) the fluorescence signal from each bead corresponds tightly to the number of 
20 fluorescent molecules attached to the bead. At present as little as few hundred 

fluorescent molecules per particle can be quantitatively detected; 

(3) the wide dynamic range of the fluorescence detectors (typically 4 log units) allows 
easy setting of the stringency of the sorting procedure, thus allowing the recovery of 
the optimal number of genetic elements from the starting pool (the gates can be set to 

25 separate beads with small differences in fluorescence or to only separate out beads 
with large differences in fluorescence, dependant on the selection being performed; 

(4) conunercially available fluorescence-activated cell sorting equipment can perform 
simultaneous excitation at up to two different wavelengths and detect fluorescence at 
up to four different wavelengths (Shapiro, 1983) allowing positive and negative 

30 selections to be performed simultaneously by monitoring the labelling of the genetic 
element with two (or more) different fluorescent markers, for example, if two 
alternative substrates for an en2>Tne (e.g. two different enantiomers) are labelled with 
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different fluorescent tags the genetic element can labelled with different fluorophores 
dependent on the^ substrate used and only genes encoding enzymes with 
enantioselectivity selected. 

(5) highly uniform derivatised and non-derivatised nonmagnetic and paramagnetic 
5 microparticles (beads) are commercially available from many sources (e.g. Sigma, and 
Molecular Probes) (Fomusek and Vetvicka, 1986). 

(vii) MULTI-STEP PROCEDURE 

10 It will be also be s^preciated that according to the present invention, it is not necessary for 
all the processes of transcription/replication and/or translation, and selection to proceed in 
one single step, with all reactions taking place in one microcapsule. The selection 
procedure may comprise two or more steps. First, transcription/replication and/or 
translation of each genetic element of a genetic element library may take place in a first 

15 microcapsule. Each gene product is then linked to the genetic element which encoded it 
(which resides in the same microcapsule), for example via a gene product-specific ligand 
such as an antibody. The microcapsules are then broken, and the genetic elements 
attached to their respective gene products optionally purified. Alternatively, genetic 
elements can be attached to their respective gene products using methods which do not 

20 rely on encapsulation. For example phage display (Smith, G.P.,I985), polysome display 
(Mattheakkis et al., 1994), RNA-peptide fusion (Roberts and Szostak, 1997) or lac 
repressor peptide fiision (Cull, et al., 1992). 

In the second step of the procedure, each purified genetic element attached to its gene 
25 product is put into a second microcapsule containing components of the reaction to be 
selected. This reaction is then initiated. After completion of the reactions, the 
microcapsules are again broken and the modified genetic elements are selected. In the 
case of complicated multistep reactions in which many individual components and 
reaction steps are involved, one or more intervening steps may be performed between the 
30 initial step of creation and linking of gene product to genetic element, and the final step of 
generating the selectable change in the genetic element. 
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If necessan', release of the gene product from the genetic element within a secondai)' 
microcapsule can be achieved in a variety of ways, including by specific competition by a 
low-molecular weight product for the binding site or cleavage of a linker region joining 
the binding domain of the gene product from the catalytic domain either enzymatically 
5 (using specific proteases) or autocatalytically (using an integrin domain). 

(viii) SELECTION BY ACTIVATION OF REPORTER GENE EXPRESSION IN SITU 

The system can be configured such that the desired binding, catalytic or regulatory activity 
10 encoded by a genetic element leads, directly or indirectly to the activation of expression of 
a "reporter gene" that is present in all microcapsules. Only gene products with the desired 
activity activate expression of the reporter gene. The activity resulting from reporter gene 
expression allows the selection of the genetic element (or of the compartment containing 
it) by any of the methods described herein. 

15 

For example, activation of the reporter gene may be the result of a binding activity of the 
gene product in a manner analogous to the "two hybrid system" (Fields and Song, 1989). 
Activation can also result from the product of a reaction catalysed by a desirable gene 
product. For example, the reaction product can be a transcriptional inducer of the reporter 
20 gene. For example arabinose may be used to induce transcription from the araBAD 
promoter. The activity of the desirable gene product can also result in the modification of 
a transcription factor, resulting in expression of the reporter gene. For example, if the 
desired gene product is a kinase or phosphatase the phosphorylation or dephosphorylation 
of a transcription factor may lead to activation of reporter gene expression. 

25 

(ix) AMPLIFICATION 

According to a further aspect of the present invention the method comprises the further 
step of amplifying the genetic elements. Selective amplification may be used as a means 
30 to enrich for genetic elements encoding the desired gene product. 
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In ail the above configurations, genetic material comprised in the genetic elements may be 
amplified and the process repeated in iterative steps. Amplification may be by the 
polymerase chain reaction (Saiki et al., 1988) or by using one of a variety of other gene 
amplification techniques including; Qb replicase amplification (Cahill, Foster and Mahan* 
5 1991; Chetverin and Spirin, 1995; Katanaev, Kumasov and Spirin, 1995); the ligase chain 
reaction (LCR) (Landegren et al., 1988; Barany, 1991); the self-sustained sequence 
replication system (Fahy, Kwoh and Gingeras, 1991) and strand displacement 
amplification (Walker et al., 1992). 

10 Various aspects and embodiments of the present invention are illustrated in the following 
examples. It will be appreciated that modification of detail may be made without 
departing from the scope of the invention. 

All documents mentioned in the text are incorporated by reference. 

15 

EXAMPLES 
Example 1. 

Enzymes can be expressed from genes in solution and genes attached to 
20 paramagnetic microbeads with identical efficiency. 

One format for the selection of genetic elements by using a change in their optical 
properties is one in which the genetic element comprises a microbead to which the gene is 
attached. Here it is shown how a gene for an enzyme (£. coli dihydrofolate reductase) can 
25 be linked to a paramagnetic bead and is translated in vitro just as efficiently as in solution. 

The E. coli folk gene encoding dihydrofolate reductase (DHFR) is PCR-amplified using 
oligonucleotides EDHFRFo and EDHFRBa. This DNA is then cloned into the pGEM-4Z 
vector (Promega) digested with Hindlll and Kpnl downstream of the lac promoter and the 
30 T7 RNA polymerase promoter. The oligonucleotide EDHFRBa appends the efficient 
phage T7 gene 1 0 translational start site upstream of the DHFR start codon. 
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DNA sequencing identifies a clone which has the correct nucleotide sequence. Bacteria 
transformed with this clone (pGEM-folA) are found to over-express active DHFR (driven 
from the lac promoter) when induced with IPTG. 

5 The fol A gene in pGEM-folA plasmid is then PCR-amplified using primers folA-FW and 
folA-BW, the resulting DNA fragment in Hindin and Xhol digested and subcloned into 
Hindin/Xhol- digested pET23a expression vector (Novagen) to give construct 
pET23a/folA. The sequence of PCR-amplified folA gene was verified by sequencing. 

10 pET23a/folA was further amplified with 5 -biotinylated primers pETfor.b and pETrev.b 
and radio-labelled by including 1 0 \xCi a^^S-dATP (Amersham Pharmacia Biotech, U.K.) 
in the PCR mix. The resulting 1765 bp double biotinylated fragment T7-folA was gel 
purified using a Qiagen kit and quantified spectrophotometrically. The specific activity of 
the product was 210000 CPM/pmol T7-folA DNA, as measured on the Beckman 

15 LS6000SC scintillation counter. 10 nM and 1 nM dilutions of this DNA were made in 1 
mg/ml Hindin digested lambda DNA to eliminate non-specific binding to the plastic). 
This PCR fragment was used thereafter to program a prokaryotic in vitro coupled 
transcription/translation system designed for linear' templates (Lesley, Brow and Burgess, 
1991). A commercial preparation of this system is used (£. coli S30 Extract System for 

20 Linear Templates; Promega) supplemented with T7 RNA polymerase (10*^ units). 

The DNA fragment is botmd to streptavidin-paramagnetic beads (0.74 nm diameter Sera- 
Mag beads, biotm-binding capacity 46nmol/mg, Seradyn, USA), partially precoated with 
biotinylated protein A (Sigma). 2 \il of 80 jiM biotinylated protein A is added to 100 \il (1 

25 mg) beads, allowed to bind at room temperature for 1 hour, washed once and coated for 
one hour at room temperature with rabbit IgG (10 p,l 1 mg/ml antibody per 1 mg beads in 
TBS/0.1% Tween-20 (TBST)). Beads were thereafter washed twice with TBS/T before 
radiolabeled biotinylated T7-folA DNA was added and allowed to bind for 1 hour at room 
temperature. The amount of boimd T7-foLA DNA was calculated by counting the 

30 radioactivity bound to an aliquot of beads. --50% of die total DNA was bound. 
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DNA fragments bound on beads or unbound DNA fragment are added directly to the S30 
Extract System. Reactions are incubated for 2 hours at 37 °C. 



Dihydrofolate reductase activity is assayed by spectrophotometrically monitoring the 
5 oxidation of NADPH to NADP at 340nm over a 10 minute time course as described by 
(Williams et aL, 1979; Ma et al., 1993). 2 \xl of each quenched in vitro translation 
reaction is added to 150^1 Buffer A (100 mM Imidazole, pH 7.0, 10 mM p- 
mercaptoethanol) and 20^1 ImM NADPH. 20^1 dihydrofolate (ImMXHiF) is added 
after 1 minute and the reaction monitored at 340nm using a ThermoMax microplate 
10 reader (Molecular Devices). Activity is calculated by initial velocities under So»Km 
conditions (omax). 

There is no significant difference in the amount of active DHFR produced if the DNA is 
free, or attached via terminal biotins to a streptavidin coated bead (see Figure 1). 

15 

Example 2. 

A fluorescent protein (GFP) can be translated in vitro from genes attached to smgle 
microbeads encapsulated in the aqueous compartments of a water-in-oil emulsion 
and the translated gene-product bound back to the microbeads making them 
20 fluorescent 

One format for the selection of genetic elements is where the genetic element comprises a 
gene linked to a microbead and the product is coupled back onto the microbead within the 
microcapsule resulting directly, or indirectly, in a change in the optical properties of the 
25 microbead which allows it to be sorted. 

Here it is shown that a fluorescent protein (green fluorescent protein or GFP) can be 
transcribed and translated in vitro from genes attached to single microbeads encapsulated 
in the aqueous compartments of a water-in-oil emulsion and the translated gene-product 
30 bound back the microbeads making them fluorescent 
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The GFP in pBS/GFP6 plasmid (Siemering et al„. 1996) was PCR-amplified using 
primers GFP-FW and GFP-BW, the resulting DNA fragment in Hindlll and Xhol digested 
and subcloned into HindllLOChoI- digested pET23a expression vector (Novagen) to give 
construct pET23a/GFP. The sequence of PCR-amplified GFP gene was verified by 
5 sequencing. pET23a/GFP was further amplified with 5'-biotinylated primers pETfor.b and 
pETrev.b. The resulting 2038 bp double biotinylated fragment T7-GFP was gel purified 
using a Qiagen kit and quantified spectrophotometrically. 10 nM and 1 nM dilutions of 
this DNA were made in I mg/ml Hindlll digested lambda DNA to eliminate non-specific 
binding to the plastic). This PGR fragment was used thereafter to program a prokaryotic 
in vitro coupled transcription/translation system designed for linear templates (Lesley, 
Brow and Burgess, 1991). A conunercial preparation of this system is used (£. coli S30 
Extract System for Linear Templates; Promega) supplemented with T7 RNA polymerase 
(10^ units). 

As a control, a biotinylated 1765bp DNA fragment T7-folA (synthesised by PGR as in 
example 1) was used to program the synthesis of the non-fluorescent protein DHFR. 



10. 



15 



150 ^il ProActive streptavidin-coated paramagnetic beads (Bangs Laboratories, 
2xl0^beads/|il) were suspended in 5mM Tris 7.4/lM NaCl/0.1% Tween20 and split into 

20 three aliquots of 50 \xl 0.5 \il of 0.2 ^M DNA (T7-folA or T7-GFP) was added to each 
aliquot of beads, mcubated at 43 ^'C for 15 min, washed three times in 25 mM NaH2P04, 
125 mM NaCl, 0.1% Tween20, pH 7.0 (PBS/0.1% Tween20), resuspended in 40 ^1 TBST 
and 10 ^1 80 ^M biotinylated protein A (Sigma) was added (to give final concentration of 
15 jiM). After incubation for 30 minutes at room temperature, the beads were washed 

25 three times in PBS/0. 1 % Tween20 and resuspended in 20 nl 1:10 dilution rabbit anti-GFP 
polyclonal antibody (Clontech) or 1 mg/ml unimmunised rabbit IgG (Sigma). After 
incubation for 30 minutes at room temperature, the beads were washed three times in 
PBS/0.1% Tween20 and resuspended in 15 ^il of S30 premix from an E. coli S30 Extract 
System for Linear Templates (Promega), sonicated for one minute in a sonication bath, 

30 then the rest of the S30 in vitro translation mixture was added (on ice) and supplemented 
with T7 RNA polymerase (10^ units). 
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The 50 \il ice-cooled m vitro translation reactions were added gradually (in 5 aliquots of 10 
|il over ~2 minutes) to 0.95 ml of ice-cooled oil-phase (freshly prepared by dissolving 4.5% 
(v/v) Span 80 (Fluka) in mineral oil (Sigma, #M-5904) followed by 0.5% (v/v) Tween 80 
(SigmaUltra; #P-8074)in a 5 ml Costar Biofreeze Vial (#2051)) whilst stirring with a 
5 magnetic bar (8x3 mm with a pivot ring; Scientific Industries International, Loughborough, 
UK). Stirring (at 1150 rpm) was continued for an additional 3 minutes on ice. Reactions 
were then incubated 3h at 32*^C. 

2 \il of emulsion were spread on a microscope slide beneath a 13 nun round cover slip and 
10 visualised using a 20xNeofluar objective on an Axioplan microscope (Zeiss) equipped 
with an RTEA CCD-1300-Y CCD camera (Princeton Instruments). Standard excitation 
and emission filters for fluorescein were used and images were processed with IPLab 
software. 

15 As can be seen from Figure 2 the GFP translated from genes attached to single 
microbeads encapsulated in the aqueous compartments of the emulsions is bound to the 
microbeads in situ when the microbeads are coated vnth an anti-GFP antibody. This 
binding is observed as concentration of fluorescence on the beads by epifluorescence 
microscopy. No bead fluorescence is observed when either the GFP gene or the anti-GFP 

20 antibody are missing. 

Example 3. 

A fluorescent protein (GFP) can be translated in vitro from genes attached to single 
microbeads encapsulated in the aqueous compartments of a water-in-oil emuision^ 
25 the translated gene-product bound back the microbeads and the increased 
fluorescence of the microbeads detected by flow cytometry. 

150 |il streptavidin-coated polystyrene beads (diameter 1 ^M; Bangs Laboratories, 2x10^ 
beads/|il) were suspended in 5mM Tris 7.4/lM NaCiyO.1% Tween20 and split into three 
30 aliquots of 50 \il 0.5 nl of 0.2 nM DNA (T7-folA or T7-GFP) was added to each aliquot 
of beads, incubated at 43^C for 15 min, washed three times in 25 mM NaH2P04, 125 mM 
NaCL 0.1% Tween20, pH 7.0 (PBS/0.1% Tween20), resuspended in 40 \il TBST and 10 
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\xl 80 jiM biotinylated protein A (Sigma)was added (to give final concentration of 15 
|iM). After incubation for 30 minutes at room temperature, the beads were washed three 
times in PBS/0.1% Tween20 and resuspended in 20 \il 1:10 dilution rabbit anti-GFP 
polyclonal antibody (Clontech) or 1 mg/ml unimmunised rabbit IgG (Sigma). After 

5 incubation for 30 minutes at room temperature, the beads were washed three times in 
PBS/0.1% Tween20 and resuspended in 15 [il of S30 premix from an £. coU S30 Extract 
System for Linear Templates (Promega), sonicated for one minute in a sonication bath, 
then the rest of the S30 in vitxo translation mixture was added (on ice) and supplemented 
with T7 RNA polymerase (10^ xmits).The 50 ^1 ice-cooled in vitro translation reactions 

10 were added gradxially (in 5 aliquots of 10 }il over -2 minutes) to 0.95 ml of ice-cooled oil- 
phase (freshly prepared by dissolving 4.5% (v/v) Span 80 (Fluka) in mineral oil (Sigma, 
#M-5904) ft)llowed by 0.5% (v/v) Tween 80 (SigmaUltra; #P-8074)in a 5 ml Costar 
Biofreeze Vial (#2051)) whilst stirring with a magnetic bar (8x3 mm with a pivot ring; 
Scientific Industries International, Loughborough, UK). Stirring (at 1150 rpm) was 

15 continued for an additional 3 minutes on ice. Reactions were then incubated 3h at 
32**C.To recover the reaction mixtures, the emulsions were spim at 3,000 g for 5 minutes 
and the oil phase removed leaving the concentrated (but still intact) emulsion at the 
bottom of the vial. PBS and 2 ml of water-saturated ether were added and the mixture was 
vortexed, centrifiiged briefly, and the ether phase removed. Beads were washed twice 

20 with PBS and finally resuspended at 10* beads/ml in PBS. 10** beads were analysed using 
a FACScalibur flow cytometer (Becton Dickinson) using excitation at 488 nm and the 
fluorescein emission filter. The GFP translated from genes attached to single microbeads 
encapsulated in the aqueous compartments of the emulsions is bound to the microbeads in 
situ when the microbeads are coated with an anti-GFP antibody. The binding of GFP to 

25 the microbeads makes them fluorescent (Fig. 2), and those beads with GFP bound can be 
clearly distuiguished from those which do not by flow cytometry (Figure 3). 

Example 4 

The product of an enzyme catalysed reaction can be captured on paramagnetic 
30 beads and beads derivatised with product identified by flow cytometry. 
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A reaction catalysed by the enzyme human glutathione 5-transferase M2-2 (GST M2-2) 
was performed to generate a biotinylated product (Figure 4). The two substrates used were 
l-chloro-2,4-dinitrobenzene (CDNB; Sigma) and reduced biotinylated-glutathione 
(Biotin-GSH). The product generated (Biotin-GS-DNP) has biotin at one end to enable 
5 coupling to streptavidin-coated paramagnetic microparticles and a 2,4-dinitrophenol 
(DNP) group which can be bound by an anti-DNP antibody. 

Biotin-GSH was synthesised by adding 100 mg biotinamidocaproate N- 
hydroxysuccinimide ester (biotin-NHS; Sigma) in 1 ml DMF to a solution of oxidised 
glutathione (Fluka) in 1 ml water, 30 ^1 12.5N NaOH plus I ml DMF. The biotin-NHS 

10 was added, on ice, in 100 \il aliquots over 20 minutes. The pH was then adjusted to 7.0 
with IN NaOH. The syrup-like precipitate which formed during the reaction was 
dissolved by warming to room temperature, vortexing and adding 300 ^il water. Stirring 
was continued for 2 hours at room temperature, the pH brought back to 7.0 by adding IN 
NaOH and stirred overnight at room temperature. NaOH was then used to bring the pH 

13 back to 7.S, the reaction stirred a further 30 minutes at room temperature and then 
incubated 30 minutes more after adduig 500 jil IM DTT. The solvents were evaporated 
under vacuum and the product purified by reverse-phase HPLC using a C8 column and a 
gradient of 10-40% Acetonitrile, 0.1% TFA. Biotin-GS-DNP was synthesised 
enzymatically in a 100 reaction containing 1 ^g purified recombinant GST M2-2, 500 

20 ^iM CDNB and 200 |iM Biotin-GSH in 0.1 M KH2PO4, 1 mM EDTA, pH6.5. Incubation 
was for 1 hour at 25^C. The reaction went essentially to completion as judged by 
following the increase in absorbance at 340 nm. Control reactions were also performed I) 
with no GST, 2) with no CDNB, and 3) with no biotin-GSH. Reactions were diluted 200 
times (giving a final concentration of 1 jiM biotin) into 5 mM Tris-HCl, 0.5 mM EDTA, 

25 1.0 M NaCl, pH7.4 (B/W buffer). 50 ^1 of the diluted reactions were mixed widi 50 ^l 
B/W buffer containing 29.3 \ig (10^ microparticles) 0.737 \im diameter Sera-MagTM 
streptavidin-coated magnetic microparticles (MG-SA; Seradyn) and incubated I hour at 
room temperature. Microparticles were separated in a microtitre plate (Falcon 3911) 
using a magnet (Dynal MPC-96) and washed three times with 10 mM Tris-HCl, 1 mM 

30 EDTA, 2.0 M NaCL pH7.4 (2xB/W buffer), then twice with PBS, 0.1% Tween 20. The 
microparticles were resuspended in a 1:2500 dilution of the mouse anti-dinitrophenol 
monoclonal antibody SPE 21-1 1 (a gift from Prof. Zelig Eshhar) in PBS/0.1% Tween 20 
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and incubated 45 minutes at room temperature. The microparticles were washed three 
limes in PBS/0. !% Tween 20, resuspended in PBS/0.1% tween 20 containing 15 jig/mi 
fluorescein (FITC)-conjugated F(ab02 fragment goat anti-mouse IgG, F(ab')2 fragment 
(Jackson; 115-096-006) and incubated 30 minutes at room temperature. The 

5 microparticles were washed four times in PBS/0.1% Tween 20, resuspended 1 ml 
PBS/0.1% Tween 20 and 2 x 10^ microparticles analysed using a FACScan flow 
cytometer (Becton Dickinson). As can be seen from Figure 5, there is no difference in the 
distribution of fluorescence intensity of beads from all three control reactions (no GST, no 
CDNB, and no biotin-OSH), where mean fluorescence is ~3. In contrast beads from the 

10 enzyme catalysed reaction have a mean fluorescence of 34, over 10 times higher. Indeed, 
using the gate shown (Fig. 5), 81.1% of beads from the enzyme catalysed reaction (and 
coated with the biotinylated product) are in the gate whereas in the control reactions no 
more than 0.06% of beads are in the gate. Hence, beads coated with the product of the 
GST catalysed reaction can easily be sorted from those which are not. 

15 

Example 5. 

Glutathione 5-traasferase M2-2 (GST M2-2) will use as a substrate caged- 
biotinylated-glutathione and the caged-biotinylated product generated can 
subsequently be uncaged by UV irradiation, captured on avidin-coated beads and 
20 detected by flow cytometry 

The synthesis of caged-biotin (5) and its derivatives (7) was based on the published 
protocols (Pinrung & Huang. 1996; Sundberg et al. (1995), However, significant 
modifications of these protocols were made in several steps of the synthesis as described 
25 below. 

Biotin methyl ester (3, Biotin-OMe) was prepared essentially as described in Sundberg 
etal.(1995)(seeFig.6): 

30 Methylnitropiperonyl alcohol (1, MeNPOH). 3',4-(Methylenedioxy)-6'- 
nitroacetophenone (Lancaster; 6.2 g., 29.6 mmol) was dissolved in a mixture of THF (100 
ml) and ethanol (100 ml). Sodium borohydride (1.12 g., 29.6 mmol) was added and the 
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solution stirred for 3 hours at room temperature. TLC (on silica coated plates; solvent - 
3% methanol in DCM) indicated the full conversion of the starting material (Rf= 0.8) to 
the alcohol (Rf^ 0.6). Hydrochloric acid (IN) was added slowly until the evolution of 
hydrogen stopped and the solvents evaporated under vacuum. The residual solid was 
5 dissolved in DCM (500 ml) and washed vnlh brine (40 mi). The organic phase was dried 
(over MgS04) and the solvent removed imder vacuum. Recrystallisation from hot DCM 
and hexane gave 6. 1 g. of 1 (a yellow crystalline solid). 

O-Methylnitropiperonyl-carbonylimdazole (2, MeNPO-CO-Im). 

10 Methylnitropiperonyl alcohol (1.69 g, 8 mmol) was added (in several portions during 20 
minutes) to a solution of carbonyldiimidazole (GDI, 2.6 g, 16 mmol) in DCM (50 ml). 
The solution was stirred for 3 hrs after which TLC indicated the complete conversion of 
the alcohol (Rf= 0.6 - 3% methanol in DCM) into product (R^ 0.45). DCM (100 ml) 
and water (30 ml) were added and the reaction mixture transferred to a separatory ftmnel. 

15 The mixture was mixed and IN HCl was added (in 1 ml aliquots) until the pH of the 
aqueous phase went below 6. The aqueous phase was removed, more water added (30 ml) 
and acidified to pH 6 while mixed. Finally, the DCM phase was washed with brine, dried 
(over MgS04) and the solvent removed under vacuum. The remnant solid was re- 
crystallised from hot DCM and hexane to give 2.2 g of 2 (a yellow crystalline solid). 

20 

N-(0-MethyliiitropiperonyI-carbonyl)-Biotin methyl ester (4, MeNPO-CO-Biotin- 
OMe). Sodium hydride (60% suspension in oil; 100 mg, 2.5 mmol) was added to a 
stirred suspension of Biotin-OMe (517 mg, 2 mmol) and MeNPO-CO-Im (305 mg, 1 
nomol) in anhydrous DCM (10 ml) on ice. The solution was stirred for 30 minutes on ice 

25 and 30 minutes at room temperature. TLC indicated the complete disappearance of the 
MeNPO-CO-Im (Rf= 0.6 - 5% methanol in DCM) and the ^pearance of the product 
(Rf = 0.45). Traces of alcohol 1 (RfN).7), and a side-product with RM).95 (probably di- 
MeNPO-carbonate) were also observed (The ratio of product vs. the above side-product 
varied from one preparation to another, careful drying of the starting materials and 

30 performing the reaction on ice gave generally higher yields of the product). 
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Once the reaction had been completed, DCM was added (100 ml) and the solution 
extracted three times with IM NaH2P04. The organic phase was dried (MgS04) and the 
solvent removed under vacuum. The renmant syrup was dissolved in hot DCM (ca, 5 nJ), 
hexane (ca. 5 ml) was added to the cloud-point and the solution was allowed to stand at 
5 4''C overnight. This resulted in the precipitation of the excess of the Biotin-OMe as a 
white crystalline solid (which was washed with ether, dried and used in subsequent 
reactions). The filtrate was concentrated in vacuum and purified by chromatography on 
silica (L5 to 3% methanol in DCM) to give 4 as a yellow foam (with yields up to 385 mg, 
or 80% based on molar equivalents of 2 as starting material). 

10 

N-(0-Methylnitropiperonyl-carbonyl)-Biotin (5, MeNPO-CO-Biotin-OH). 

MeNPO-CO-Biotin-OMe (940 mg; 1.73 tamol) was dissolved in 25 ml of 0.5N HCl and 
dioxane (4:6; flashed with argon). The solution was stirred at 44*^0 for 24 hours under 
argon. The solvents were reduced under vacuum to ca. 1 ml, water was added (10 ml) and 

15 the resulting mixture lyophilised. The resulting solid was dissolved in DCM with 2% 
methanol (20 ml) and charcoal was added. The mixture was boiled for few minutes and 
filtered. TLC (10% methanol in DCM) indicated the appearance of the product of the 
hydrolysis (Rf^ 0.2) and about 5% of starting material (MeNPO-CO-Biotin-OMe; Rf= 
0.9). The solvents were removed under vacuum to give a yellow solid that was dried under 

20 vacuum (860 mg of ca. 95% of 5 plus 5% of 4). Higher concentrations of HCl (e.g., IN) 
and higher temperamres (e.g., refliLX with THF as a co-solvent) resulted in complete 
hydrolysis of the methyl ester. However, significant amount of alcohol 1 and biotin were 
also observed, indicating the hydrolysis of the carbamate under these conditions. It should 
also be noted that methyl ester 4, and in particular, the product of its hydrolysis (5) were 

25 found to be sensitive to oxidation. Warming or even storing solutions of 5 in the presence 
of air resulted in brownmg. Similarly, attempts to purify 5 (or derivatives of, e.g., 7) by 
chromatography on silica led to very high losses due to oxidation. 

N-(N-(0-MethylnitropiperonyUcarbonyl)-Biotin)-3-aminopropionic acid tert-butyl 
30 ester (6, MeNPO-CO-Biotin-P-Ala-OBu^). MeNPO-CO-BioUn-OH (860 mg containing 
-5% of MeNPO-CO-BioUn-Ome; -1.6 nmiol) was dissolved in 20 ml of anhydrous 
DCM. p-Alanine tert-butyl ester (H-|i-Ala-OBu^) hydrochloride salt (Bachem; 362 mg; 2 
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mmol), N-hydroxysuccinimide (172 mg; L5 mmol) and triethylamine (280 |il; 2 mmol) 
were added. The stirred solution was cooled on ice and EDCI was added (420 mg; 2.2 
mmol). The reaction was stirred for 24 hours at 4°C and 2 hours at room temperature. 
TLC (5% methanol in DCM) indicated the appearance of the product (Rf= 0.3) and the 
5 remaining, unreacted MeNPO-CO-Biotin-OMe (Rf= 0.45). The reaction was diluted with 
DCM (30 ml) and extracted three times with IM NaH2P04 and once with saturated 
NaHCOj. The organic phase was dried (Na2S04) and the solvent removed under vacuum. 
The remnant syrup was purified by chromatography on silica (3.0-4.5% methanol in 
DCM) to give 640 mg of 6 (a yellow foam). 

10 

N-(N-(0-Mcthylnitropiperonyl-carbonyl)-Biotin)-3-ammopropionic acid (7, 
MeNPO-CO-Biotin-P-Ala-OH). Tert-butyl ester 6 (510 mg; 0.84 mmol) was dissolved 
in 15 ml of 0.5N HCl and dioxane (4:6; flashed with argon). The solution was stirred at 
52°C for 24 hours under argon. Water was added (10 ml) and the resulting solution was 
15 freeze-dried to give a solid that contained (as judged by TLC) the product of the 
hydrolysis (7) and starting material (6; - 10%). This mixture was purified by column 
chromatography on silica (10% methanol in acetone plus 0.1% acetic acid) to give 60mg 
of 7 (the low yields were primarily the result of oxidation of 7 on the silica). 

20 N-(N'KN-(0-Methylnitropiperonyl-carbonyI)-Biotin)-3-aminopropionyl)-glutathione 
(8, MeNPO'CO-Biotin-P-AIa-GSH). Carbonyldiimidazole (20 mg, 120 |imol) was 
added to a solution of MeNPO-CO-Biotin-p-Ala-OH (7, 49 mg, 89 ^mol) in DMF (1.5 
ml). The solution was stirred for 30 minutes at room temperature and was then added, in 
several aliquots, to a solution of oxidised glutathione (62 mg, 100 ^unol) and 

25 triethylamine (55 jil, 0.4 mmol), in DMF (2 ml) plus water (0.15 ml), stirred on ice. The 
solution was stirred on ice for 30 minutes and then at room temperature. Triethylamine 
was added, until the solution became clear (25 nl), and the reaction was then stirred for 
another 2 hours at room temperature. DTT was then added (0.25 ml of IM solution; 0.25 
mmol), and the solution was stirred at room temperature for 10 minutes. 

30 The product of the above reaction was purified by reverse-phase HPLC, on an RP-8 
preparative column, using a water-acetonitrile gradient in the presence of 0.1% 
trifluoroacetic acid. The peak corresponding to 8 (retention time = 28.6 minutes) ^yas 
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collected. The product was then isolated by freeze-drying and purified again on reverse- 
phase HPLC (using the same column and solvent system). Analysis of the product after 
the second HPLC purification, using analytical reverse-phase HPLC, indicated a product 
(>95%) the UV spectrum of which corresponded to 8 (specifically, at 355 nm 
5 indicated the presence of the O-methylnitropiperonyl-carbonyl group of the caged-biotin). 
The concentration of 8 was determined by titrating the free thiol groups (using DTNB, 
5,5'-dithioftw (2-nitrobenzoic acid), as Hermanson, 1996) derived from the glutathione, 
and also by absorbance at 355nm (corresponding to the caged-biotin). Both these 
independent measurements gave the same result within experimental error. 

10 

The purified 8 was also found to be a substrate for human M2-2 GST in the electrophilic 
substitution of CDNB (monitored by the change of absorbance at 340 nm; Habig & 
Jakoby, 1981) with rates that are about 10 fold slower than those observed with 
glutathione under similar conditions. 

15 

The reduced MeNPO-CO-Biotin-p-Ala-GSH (caged-biotin-Pala-GSH) was reacted with 
either l-chloro-2,4-dinitrobenzene (CDNB; Sigma) or 4-chloro-3-nitroben2oate (CNB, 
Acros). The caged product generated does not bind avidin or streptavidin. However, after 
photochemical uncaging by ultraviolet radiation the product has a biotin at one end which 
20 will bind to avidin or streptavidin-coated micropaiticles and either a 2,4-dinitrophenol 
(DNP) or a 3-nitrobenzoate group which can be bound by appropriate anti-DNP or anti-3- 
nitrobenzoate antibodies (see Figs. 7 & 8) 

5 ^l (10^ beads) LO ^im diameter nonfluorescent neutravidin labelled microspheres 
25 (Molecular Probes, F-8777) were spun in a microfuge at 10,000 g for 3 min. and the 
supernatant removed. The beads were resuspended in 5 |il 0.1 M KH2PO4, pH 6.5, 1 mM 
EDTA, 2 mM dithiothreitol, 10 \iM caged-biotin-pala-GSH, and either 500 ^iM CDNB or 
500 |iM CNB. The 5 \i\ reaction mixes contained either 0.75 |ig purified recombinant 
human GST M2-2 or no enzyme. 

30 

Reactions were incubated for 30 min (CDNB reactions) or 4 hours (CNB reactions) at 
25**C, after which time they were stopped by the addition of 35 jil 0,1 M sodium acetate, 
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pH 5.0 and iransferred to ice. Each reaction was then split into two aliquots of 20 jil each, 
one of which was placed as a spot on a layer of parafilm on the surface of an ice-cooled 
aluminium block. This spot was then irradiated for 2 min with a B 100 AP UV lamp 
(UVP) held at a distance of 6 cm. The other aliquot was left un-irradiated. All samples 
5 were then incubated 30 mins. at ambient temperature and then washed three times with 
200 \i\ PBS, 0.1 % Tween 20 in a 0.45 \im MultiScreen-HV filter plate (Millipore, 
MAHVN4510), thoroughly resuspending between each wash. 

Beads were then resuspend in 200 jil PBS, O.l % Tween 20 containing 20 ng/jil Alexa- 

10 488 labelled rabbit anti-DNP antibody (Dako, #V0401) 20 ng/jil Alexa-488 labelled anti- 
CNB antisera and incubated for I hour at room temperature. The anti-CNB antiserum was 
elicited in rabbits by inununisation with CNB-CH2-KLH conjugates prepared by adding 
aliquots of a 200 mM solution of 4-(bromomethyl)-3-nitrobenzoic acid (CNB-CH2Br) in 
DMF to 5 mg/ml solutions of bovine serum albumin (BSA) or keyhole limpet 

15 hemocyanin (KLH) in 50 mM borate pH 8.8 (to give 1.5 to 6 jimole of CNB-CH2Br per 
mg protein). The reaction mixtures were stirred for 6 hours at room temperature and 
temperature, and the resulting protein conjugates were dialysed extensively against 
phosphate buffer saline (PBS) at 4°C. The level of conjugation (hapten density or Hd) 
was determined by measuring optical densities of the conjugates at 355nm. These were 

20 found to be: 7 to 11 CNB-CH2 groups per BSA molecule and 9.4 to 24.3 per KLH 
molecule depending on the amount of CNB-CH2Br added to the protein samples. The 
CNB-CH2-KLH conjugate with Hd of 14.2 was used to inmiunise rabbits using published 
protocols (Tawfik et al., 1993; Tawfik et al., 1997) (by Prof. Z Eshhar, Weizmann 
Institute of Science, Rehovot). Sera were tested by ELISA for binding the conjugate CNB- 

25 CH2-BSA (Hd==ll) and to BSA. The first bleed firom both immunised rabbits (when 
diluted 50 fold or more) exhibited the desirable selectivity yielding high signal when 
incubated with the CNB-CH2-BSA conji^ate and very low background (<5%) with BSA. 
The anti-CNB serum was purified using a HiTrap Protein A column (Pharmacia). Both 
anti-CDNB and anti-CNB antibodies were labelled with an Alexa Fluor 488 protein 

30 labelling kit (Molecular Probes) according to the manufacturer's instructions. 
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The beads were washed three limes with 200 pi PBS, 0.1 % Tween 20 as above, then 
resuspended in 1 ml PBS, 0,1 % Tween 20 and 10,000 events analysed using a FACScan 
flow cytometer (Becton Dickinson). 

5 As can be seen from Fig. 9, the caged-biotin moiety is uncaged on UV irradiation and 
binds to beads. A 19-fold increase in mean bead fluorescence was observed after GST 
M2-2 catalysed reaction of caged-biotin-Pala-GSH with CDNB even in the absence of 
UV irradiation. This correlates with the apparent presence of -4% biotin-Pala-GSH in the 
preparation of caged-biotin-pala-GSH as determined by using fluorimetry to measure the 

1 0 displacement of 2-anilonaphthaIene-6-suIphonic acid (2,6- ANS) from avidin (Mock et al, 
1985). These results are consistent with the previously observed background 
inmiobilisation of caged-biotin to avidin Mn the dark' (i.e., without UV illumination) 
which was as high as 15% of the signal observed after illimiination (Sundberg et al. 
1995). The 'dark' signal observed previously was ascribed to either trace contaminants of 

15 biotin in the caged-biotin preparation, or to weak interactions between avidin and 
components of the caged-biotm including the linker (Sundberg et al. 1995). After UV 
irradiation a large difference in the mean fluorescence of those beads incubated in the 
presence and absence of GST was observed. The mean bead fluorescence with GST was 
84 times and 56 times that observed wathout GST with CDNB and CNB as substrates 

20 respectively (Fig. 9). 

Example 6. 

Glutathione 5-transferase M2-2 (GST M2-2) compartmentalised in the aqueous droplets of 
a water-in-oil emulsion catalyses the reaction of caged-biotinyiated-glutatbione with 4> 
25 chloro-3-nitrobenzoate (CNB). The cs^ed-biotinylated product generated remains 
compartmentalised and can subsequently be uncaged by UV irradiation in the 
compartmentSf captured on an avidin-coated bead in the same compartment and the 
product-coated beads detected by flow cytometry. 

30 20 |ii aliquots (4 x 10^ beads) of 1.0 fxm diameter nonfluorescent neutravidin labelled 
microspheres (Molecular Probes, F-8777) or 0.93 |im diameter streptavidin-coated 
polystyrene beads (Bangs Laboratories) were each spvm in a microfiige at 2,600 g (6,500 
rpm) for 3 min. The supernatant was removed and the beads resuspended , on ice, in 20 yX 
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OA M KH2PO4, pH 6.5, I mM EDTA, 2 mM dithiothreitol, 50 caged-biotin-pala- 
GSH, containing eitlier 3 |ig purified recombinant human GST M2-2 or no enzyme. 

Six reaction mixtures were then emulsified essentially as Tawfik & Griffiths (1998): 
5 a) Bangs beads, no GST 

b) Bangs beads, plus GST 

c) Molecular Probes beads, no GST 

d) Molecular probes beads, plus GST 

e) Bangs beads, no GST 

10 f) Molecular Probes beads, no GST 

The oil phase was freshly prepared by dissolving 4.5% (v/v) Span 80 (Fluka) in mineral 
oil (Sigma, #M-5904) followed by 0.5% (v/v) Tween 80 (SigmaUltra; #P-8074). Ice- 
cooled reaction mixtures were added gradually (in 5 aliquots of 4 ^il over -2 minutes) to 
15 0.4 ml of ice-cooled oil-phase in a 5 ml Biofreeze Vial (Costar, #2051) whilst stirring 
with a magnetic bar (8x3 nmi with a pivot ring; Scientific Industries International, 
Loughborough, UK). Stirring (at 1 150 rpm) was continued for an additional 1 minute on 
ice. 

20 8 jil of emulsion d) was added to 0.4 ml emulsion e), and 8 jil of emulsion b) was added 
to 0.4 ml emulsion f) (to give 1:50 dilutions) and the emulsion mixtures vortexed for 5 
seconds to mix. 

Six reaction mixtures were left non-emulsified: 
25 a) Bangs beads, no GST 

b) Bangs beads, plus GST 

c) Molecular Probes beads, no GST 

d) Molecular probes beads, plus GST 

e) Bangs beads, no GST 

30 f) Molecular Probes beads, no GST 
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0.4 |il of d) was added to 20 ^il of e), and 0.4 ^il b) was added to 20 ^il of f) (to give 1:50 
dilutions). 

Both emulsions and non-emulsified reactions were incubated for 15 min at 25''C. Then 
5 0.8 \il 500 mM CNB (in absolute ethanol) was added to each 0.4 ml emulsion and the 
emulsion vortexed for 5 seconds (the CNB is transferred through the mineral oil to the 
aqueous compartments). 5. pi 5 mM CNB (in 0.1 M KH2PO4, I mM EDTA, pH, 6.5) was 
added to the non-emulsified reactions. 

1 0 All reactions were incubating for 4 hours at 25*'C 

The pH of the aqueous droplets was lowered to quench the GST catalysed reaction by 
vortexing the emulsions with 200 \il Sigma Mineral Oil for Molecular Biology (M-5904) 
containing 4.5% Span 80 (Fluka), 0.5% Tween 80 (Sigma Ultra) in Sigma Mineral OU for 
15 Molecular Biology) and 25 mM acetic acid. The non-emulsified reactions were quenched 
by adding 25 \i\ 0.5 M acetic acid. 

All reactions were transferred to a 24-well flat bottom plate (Coming, #25820) floating on 
iced water and irradiated for 2 min with a B 100 AP UV lamp (UVP) held at a distance of 
20 - 6 cm. All samples were then incubated 30 mins. at ambient temperature. 

The emulsions were transferred to 1.5 ml microfuge tubes, spun 1 min. 13.5k rpm in a 
microfuge and the oil phase removed leaving the concentrated (but still intact) emulsion at 
the bottom of the tube. 200 ^l O.IM Na acetate, pH 5.0 were added and the emulsion 
25 broken by extracting 4 times with 1 ml hexane, vortexing between each hexane addition. 
Residual hexane was removed by spinning for 10 min at ambient temperature under 
vacuum in a Speedvac (Farmingdale, NY). 

All samples were then washed three times with 200 nl PBS, 0.1 % Tween 20 in a 0.45 ^im 
30 MulUScreen-HV filter plate (Millipore, MAHVN45 1 0), thoroughly resuspending between 
each wash. Beads were then resuspend in 200 ^l PBS, 0.1 % Tween 20. 25^1 (-5 x 10^ 
beads) were then added to 200 jil PBS, 0.1 % Tween 20 containing 20 ng/yA Alexa-488 
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labelled anti-DNP antibody or 20 ng/|al Alexa-488 labelled anli-CNB antibody (see 
Example 5) and incubated for I hour at ambient temperature. The beads were washed 
three times with 200 [il PBS, 0.1 % Tween 20 as above, then resuspended in 1 mi PBS, 
0.1 % Tween 20 and 300,000 events analysed using a FACScan flow cytometer (Becton 
5 Dickinson). 

In the non-emulsified mixtures, where neither GST nor the product of the GST catalysed 
reaction. (caged-biotin-pAla-NB) were compartmentalised, all beads have a similarly low 
fluorescence (Fig. 10, Panels B and D). In contrast, in the emulsion mixtures, where both 

10 GST and the product of the GST catalysed reaction, (caged-biotin-pAla-NB) were 
compartmentalised, two populations of beads, one of low and one of higher fluorescence 
are clearly visible (Fig. 10, Panels C and E). Gating through Rl and R2 enables the Bangs 
and Molecular Probes beads to be largely separated on the basis of their slightly different 
light scattering characteristics (Fig 10, Panel A). The ratio of Bangs to Molecular Probes 

15 beads passing through Rl is 68%:0.1% and the ratio passing through R2 is 0.08%:87%. 
Using these gates it is clear that the beads v«th high fluorescence are those which were 
compartmentalised with the enzyme GST. Hence, compartmentalisation of beads, en2yme 
and reaction product was obtained by emulsification and those beads present in 
compartments which contained en2ymes can be distinguished from those which do not by 

20 their fluorescence characteristics. 

Example 7. 

Human GST M2-2 can be transcribed and translated in vitro in the aqueous 
compartments of a water-in oil emulsion and catalyses a reaction which gives rise to 
25 a change in the fluorescence properties of co-compartmentalised microspheres. 

The gene encoding htiman glutathione 5-transferase M2-2 (GST M2-2) is amplified by 
PGR using oligonucleotides GSTM2-2Fo and GSTM2-2Bc from a human GST M2-2 
cDNA clone in pGEM-3Z (Baez et al., 1997), The PGR fragment is cloned into the vector 
30 pGEM-4Z (Promega) digested with Hindlll and Kpnl downstream of the lac promoter and 
T7 RNA polymerase promoter. The oligonucleotide GSTM2-2Bc appends die efficient 
phage T7 gene 10 translational start site upstream of the methyltransferase gene start 
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codon. DNA sequencing identifies a clone with the correct nucleotide sequence, termed 
pGEM-hGSTM2-2. The pGEM-hGSTM2-2 plasmid described above is amplified by PGR 
using primers LMB2 and LMB3 as above to create a 826 base pair PGR fragment 
(GSTM2-2.LMB2-3) which carries the T7 RNA polymerase promoter, the phage T7 gene 
5 10 translational start site and the GST gene. The PGR fragment is purified directly using 
Wizard PGR Preps (Promega). 

60 ^1 aliquots (1.2 x 10^ beads) of l.O |im diameter nonfluorescent neutravidin labelled 
microspheres (Molecular Probes, F-8777) were spun in a microfuge at 10,000 g for 3 min. 

1 0 The supernatant was removed and the beads resuspended ,on ice, in 60 ^1 of a prokaryotic 
in vitro coupled transcription/translation system designed for linear templates (Lesley et 
al., 1991). A commercial preparation of this system is used (£ coli S30 Extract System 
for Linear Templates; Promega) supplemented with 12.5 mM acetic acid (to lower the pH 
to --7.0), T7 RNA polymerase (2,000 units), 12.5 jig/ml X DNA-HindRl digest (New 

15 England Biolabs), 50 ^iM caged-biotin-(iala-GSH, and, optionally, 5 nM GSTM2- 
2.LMB2-3 DNA or 5.0 |ig of purified recombinant human GST M2-2 per 50 jil (or 
neither). 

A 5 |il aliquot was removed from each reaction mixture and left non-emulsified. 50 [il of 
20 the remaining reaction mixture was emulsified essentially as Tav^^k & Griffiths ( 1 998), 

The oil phase was freshly prepared by dissolving 4.5% (v/v) Span 80 (Fluka) in mineral 
oil (Sigma, #M-5904) followed by 0.5% (v/v) Tween 80 (SigmaUltra; #P-8074). Ice- 
cooled reaction mixtures were added gradually (in 5 aliquots of 10 nl over -2 minutes) to 
25 1.0 ml of ice-cooled oil-phase in a 5 ml Biofreeze Vial (Costar, #2051) whilst stirring 
with a magnetic bar (8x3 nun with a pivot ring; Scientific Industries International, 
Loughborough, UK). Stirring (at 1 150 rpm) was continued for an additional 1 minute on 
ice. 

30 Both emulsions and non-emulsified reactions were incubated for 45 min at IS^'C to allow 
translation to proceed. Then 5 |il 100 mM l-chloro-2,4-dinitrobenzene (CDNB) (in 
absolute ethanol) was added to each 1.0 ml emulsion and the emulsion vortexed for 5 
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seconds (the CDNB is transferred through the mineral pil to the aqueous compartments). 
1.0 |il 2.5 mM CDNB (in water) was added to the non-emulsified reactions. CDNB 
inhibits in vitro translation and adding it in this way, after translation is completed, 
maximises the yield of GST. 

5 

All reactions were incubating for 30 mins at 25°C. The pH of the aqueous droplets was 
then lowered to quench the reaction by vortexing the emulsions with 500 Sigma 
Mineral Oil for Molecular Biology (M-5904) containing 4.5% Span 80 (Fluka), 0.5% 
Tween 80 (Sigma Ultra) in Sigma Mineral Oil for Molecular Biology) and 25 mM acetic 
10 acid. The non-emulsified reactions were quenched by adding 5 fil 0.5 M acetic acid and 
20 ^l O.IM Na acetate, pH 5.0. 

All reactions were transferred to a 24-well flat bottom plate (Coming, #25820) floating on 
iced water and irradiated for 2 min with a B 100 AP UV lamp (UVP) held at a distance of 
15 - 6 cm. All samples were then incubated 30 mins. at ambient temperature. 

The emulsions were transferred to 1.5 ml microfuge tubes, spun 1 min. 13.5k rpm in a 
microfiige and the oil phase removed leaving the concentrated (but still intact) emulsion at 
the bottom of the tube. 200 \il O.IM Na acetate, pH 5.0 were added and the emulsion 
20 broken by extracting 4 times with 1 ml hexane, vortexing between each hexane addition. 
Residual hexane was removed by spinning for 10 min at ambient temperature under 
vacuum in a Speedvac (Fanningdale, NY). 

Approximately 5x10^ beads from the broken emulsions and the non-emulsified reactions 
25 were then washed three times with 200 \il PBS, 0.1 % Tween 20 in a 0.45 \im 
MultiScreen-HV filter plate (Millipore, MAHVN4510), thoroughly resuspending between 
each wash. Beads were then resuspend 200 ^1 PBS, 0.1 % Tween 20 containing 10 ng/jil 
Alexa-488 labelled anti-DNP antibody (see Example 5) and incubated for 1 hour at 
ambient temperature. The beads were washed three times with 200 \xl PBS, 0.1 % Tween 
30 20 as above, then resuspended in 1 ml PBS, 0.1 % Tween 20 and 10,000 events analysed 
using a FACScan flow cytometer (Becton Dickinson). 
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As can be seen from Fig. 1 L both in emulsified and non-emulsified reactions, the reaction 
catalysed by in vitro translated GST M2-2 results in an in beads with higher fluorescence 
than when no enzyme was present. This difference in fluorescence would, however, not 
be sufficient for efficient fluorescence activated sorting (FACS). However, beads from 
5 both emulsified and non-emulsified reactions containing 5.0 \xg of purified recombinant 
GST M2-2 per 50 jaI were even more fluorescent than those containing in vitro translated 
GST M2-2 enabling efficient enrichment of these beads by FACS from those incubated in 
the absence of GST. This simulates the situation where a mutant GST of higher activity 
than wild-type is translated in vitro. 

10 

Example 8. 

Genes attached to microbeads are expressed in vitro and the resulting gene-product 

(an enzyme) binds to the microbeads whilst retaining catalytic activity. 

One format for the selection of genetic elements is where the genetic element comprises a 

1 5 gene linked to a microbead, which is translated in a microcapsule, and the translated gene- 
product is coupled back onto the microbead withm the microcapsule. Thus, 
compartmentalisation leads to the formation of complexes of gene-products (e.g., proteins 
or enzymes) attached to the gene encoding them. These complexes could be subsequently 
selected for bmding a ligand (see Example 12), or for enzymatic activity via a second 

20 compartmentalised reaction. 

Here it is shown, that an enzyme (phosphotriesterase or PTE) can be transcribed and 
translated in vitro from genes attached to microbeads and the translated enzyme is bound 
back the microbeads. We also show that the translated enzyme can be modified, 
25 assembled or complemented with a cofactor whilst it is bound on the beads - in this 
example, metal ions are added to the apo-enzyme to give an active metalloenzyme. 
Moreover, we show here that the catalytic activity of the enzyme is retained whilst it is 
bound to the microbead together with the gene that encodes it. 

30 The opd gene encoding a phosphotriesterase (PTE; also known as paraoxon hydrolase; 
Mulbry & Kams, 1989) is amplified fix)m Flavobacterium sp. strain ATCC 27551 by 
PGR using a forward primer that appends stop codons and an EcoRl site (OPD-Fo; see 
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Table 1), and a back primer that appends the phage T7 gene 10 transitional site (RBS) and 
a Hindlll cloning site (OPD-Bc). This DNA is cloned into pGEM-4Z using the HindUl 
and the EcoRl sites downstream of the T7 RNA polymerise promoter. DNA sequencing 
identifies a clone which has the correct nucleotide sequence. Bacteria (£. coli, TGI) 
5 transformed with this clone (Gem-OPD) are found to overexpress active PTE when grown 
in the presence of cobalt chloride and induced with IPTG (Omburo et aL, 1992). 

The OPD gene is also cloned with a Flag^w peptide (Met-Asp-Tyr-Lys-Asp-Asp-Asp- 
Asp-Lys; Sigma-Aldrich) appended to its N-terminus. The OPD gene is amplified 

1 0 Flavobacterium sp. strain ATCC 2755 1 by PGR using a forward primer (N-Flag-OPD-Fo) 
that appends stop codons and a Kpnl site, and a back primer (N-Flag-OPD-Bc) appending 
an Ncol site, a Flag peptide and a short linker between the Flag peptide and the OPD 
reading fi^e. The resulting DNA fragment is cloned into plasmid pGEM-4Z^"'^ (using 
the Kpril and Ncol sites). pGEM-4Z^*'*'' is a modification of p-GEM-4Z into which, the 

15 phage T7 gene 10 transitional site (RBS) and an ATG start codon are appended 
downstream to the T7 RNA polymerise promoter, to create an Ncol site that allows 
cloning of reading firames in the context of the RBS and ATG codon. The sequence of the 
section incorporated into pGEM-4Z (between the HindUl and the Kpnl sites downstream 
to the T7 RNA polymerise promoter), to give pGEM-4Z^^^ is indicated in Scheme I. 

20 

The rest of pGEM-4Z, including the Kpnl and EcoRl cloning sites, remained intact. 

S^ ^AAGCTTA ATAATTTTOTTTAACTTTAAGAAGGAGATATAGCQiZ^ 

pGEM-4Z - Hmdm site appended RBS, ATG and Ncol cloning site 

25 

..,. GGTACC- >3' 

Kpnl siteofpGEM-42 

Scheme I 

30 

DNA sequencing identifies a clone that has the correct nucleotide sequence. Bacteria 
transformed with this clone (Gem-N-Flag-OPD) are found to over-express an active PTE 
when grown in the presence of Cobalt Chloride and induced with IPTG. 
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The gem-OPD and gem-N-Flag-OPD plastids described above are amplified by PGR, 
using primers LMB2-biotin and LMB3, to create DNA fragments (OPD.LMB3-2biotin 
and N-Flag-OPD.LMB3-2biotin, respectively) that carry the T7 RNA polymerise 
promoter, the phage T7 gene 10 transitional start site and the OPD or the N-Flag-OPD 
5 genes and are labelled vnth biotin at the 3' end. The PGR fragments are purified directly 
using Wizard PGR Preps (Promega). 

Aliquots of a suspension of 0.95 ^im non-fluorescent streptavidin labelled microspheres 
(Bangs, --2 X lO' beads per ^1 suspension) are spun in a microfiige at 10,000 g (13.500 
rpm) for 3 min. The supernatant is removed and the beads resuspended in TNT buffer 

10 (O.IM Tris 7.5, 0.15M NaCl, 0.05% Tween-20). An antibody that is capable of binding 
amino-termini Flag peptides and is labelled by biotinylation (BioM5, a biotin-labelled 
anti-Flag antibody; Sigma) is added to the bead suspensions to an average of 4 x lO'* 
antibody molecules per bead. The resulting mixture is incubated for several hours with 
occasional mixing. The beads are rinsed twice by spinning down and resuspending them 

15 in TNT buffer. Biotinylated DNA fragments (fragments OPD.LMB3-2biotin, N-Flag- 
OPD.LMB3-2biotin, or fragments that carry the T7 RNA polymerise promoter, the phage 
T7 gene 10 transitional start site and a gene encoding a different enzyme that is also 
tagged with N-Flag peptide, e.g., methyltransferase Hae/// - N-Flag-M.Hae///.LMB3- 
2biotin) are added to a suspension of antibody-coated beads and the mixture is incubated 

20 overnight at 4^C, The beads are rinsed 3 times by spinning down and resuspending them 
in TNT buffer. 

50 ^ll aliquots of the above suspension of beads (-10' beads) are spim in a microfuge at 
10,000 g for 3 min. The supernatant is removed and the beads gently resuspended, on ice, 

25 in 50 |il of a prokaryotic in vitro coupled transcription/translation system designed for 
linear templates (Lesley et al., 1991). A conamercial preparation of this system is used (£. 
coli S30 Extract System for Linear Templates; Promega) supplemented with T7 RNA 
polymerise (2,000 units). The reactions are incubated at 25°C for 1.5 hours and spun in a 
microfuge at 10,000 g for 3 min. The supematant is removed and the beads resuspended 

30 in 100 ^il of 50 mM Tris, 10 mM of Potassium Garbonate, pH 8.0. An aqueous solution 
of Gobalt Ghloride is added to a concentration of 1 mM and the reactions incubated for 
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several hours at room temperature (or overnight at 4°C). The beads are rinsed 4 limes by 
spinning down and resuspending them in TNT buffer. 

Aiiquots of the above beads are added to a solution of 0.25 mM Paraoxon in 50 mM Tris 

5 pH 8.3. The beads are incubated at 25*'C with occasional stirring for different periods of 
time. The beads are spun in a microfuge at 10,000 g for 3 min, the supernatant is removed 
and its optical density measured at 40Snm. A significant change in optical density, 
relative to the optical density observed under the same conditions in the absence of beads 
or phosphotriesterase, is not observed when beads to which biotinylated DNA fragments 

10 OPD.LMB3-2biotin or N-Flag-M.Hae///.LMB3-2biotin are attached (and are 
subsequently reacted as described above) are incubated with Paraoxon. However, a 
significant change in optical density at 405nm is observed when beads to which 
biotinylated DNA fragments N-Flag-OPD.LMB3-2biotin are attached (and are 
subsequently reacted as described above) are incubated with Paraoxon. For example, 

15 when biotinylated DNA fragments N-Flag-OPD.LMB3-2biotin are added at a 
concentration of 1 nM (to a 50 |il suspension of beads (-10^ beads) that is then 
resuspended in 50 |il in vitro transcription/translation), and reacted as described above, 
the change in optical density observed after 3 hours corresponds to more than 50% 
hydrolysis of Paraoxon (at 0.25 mM in a 50 |il reaction volume). Thus, microbeads 

20 carrying a gene encoding a protein with die desired catalytic activity (phosphotriesterase 
in the above example) can be clearly distinguished from microbeads carrying genes that 
do not encode a protein with the desired catalytic activity (methyltransferase HaelR in the 
above example). Moreover, almost no change in optical density at 405nm is observed 
when biotinylated DNA fragments N-Flag-OPD.LMB3-2biotm are attached to beads and 

25 reacted as described above, except that Cobalt Chloride is not added to the resuspended 
beads after transcription/translation. 

These results show that an en2yme (phosphotriesterase) can be transcribed and translated 
in vitro firom genes that encode this enzyme and are attached to microbeads. When the 
30 genes encode a tag - an N-terminus Flag peptide in the above example - the translated 
enzyme binds back to the microbeads to which the genes are attached. If necessary, the 
translated enzyme can be then modified whilst it remains attached to the microbeads 
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(together with the gene that encodes it) - in this example, Cobah ions are added to give a 
reactive metallo-enzyme. These result also indicate that the enzyme is cataiytically active 
whilst it is bound to microbeads together with the gene that encodes it. 

5 

Example 9. 

An enzyme catalyses a reaction with a caged-biotinylated substrate, and the caged- 
biotinyiated product generated is uncaged by UY irradiation and captured on 
streptavidin-coated microbeads. Subsequently these beads are detected by flow- 
10 cytometry. 

One format for the selection of genetic elements is where the genetic element comprises a 
gene linked to a microbead, which is translated in a microce^sule, and the translated gene- 
product is coupled back onto the microbead within the microcapsule. Thus, 
1 5 compartmentalisation leads to the formation of complexes of gene-products (e.g., proteins 
or enzymes) attached to the gene encoding them. These complexes could be subsequentiy 
selected for binding a ligand (see Example 12), or for enzymatic activity via a second 
compartmentalised reaction. 

20 However, for such complexes to be selected for catalytic activity, a soluble substrate 
should be available for the immobilised enzyme, and, once the catalytic reaction had been 
completed, the product of the enzymatic activity that is being selected for should become 
attached to the gene encoding this enzyme. The resulting complexes could be then sorted 
or selected by virtue of the product being linked to them, for example by using a 

25 fluorescently-labelled antibody that recognises the product. In other compartments, 
containing complexes of genes and gene-products that do not encode proteins with the 
desired enzymatic activity, the unreacted substrate should become linked to the gene. 
These complexes will not be labelled with the product and will therefore be discarded. 
Here it is shown that an enzyme (phosphotriesterase or PTE) can react with a caged- 

30 biotinyiated substrate in the presence of streptavidin-coated beads. The caged-biotinylated 
product generated can then be uncaged by UV irradiation and captured on avidin-coated 
beads. Subsequently, these beads are detected by flow cytometry and are clearly 
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distinguished from beads incubated with a caged-bioiinylated substrate in the presence of 
other enzymes or proteins that do not exhibit phosphotriesterase activity. 

A caged-biotinylated substrate for PTE (EtNP-Bz-Glu-cagedBiotin; Fig. 12) is 
5 synthesised as follows: 

Boc-5-aminopentanol: Di-tert-butyl dicarbonate (20.8 g; 0.095 mol) is added to stirred 
solution of 5-aminopentanol (10.37 g; 0.1 mol) in dicholoromethane (DCM) (200 ml) on 
ice. Following addition, the solution becomes turbid and a syrup separates. Triethylamine 

10 is added (13.8 ml; 0.1 mol) drop-wise, and the resulting solution is stirred for 10 minutes 
on ice and then overnight at room temperature. The solvents are removed under vacuum, 
the resulting syrup is dissolved in ethyl acetate (500 ml), extracted 3 times with IM NaiH- 
PO4 (pH 4), once with saturated NaHCOa, and finally with brine, and then dried over 
MgS04. The solvents are removed under vacuum and the resulting syrup (after extensive 

15 drying under vacuum in the presence of potassium hydroxide), comprised primarily of 
Boc-5-aminopentanol, is used without further purification. 

(11) Triethylamine (3 ml; 22 mmol) is added drop-wise to a stirred solution of p- 
nitrophenyl phoshphodichloridate (5.15 g; 20 mmol) and ethanol (1.15 ml, 20 mmol) 

20 cooled on dry-ice in acetone, with in 30 minutes. The solution is allowed to slowly warm 
up to room temperature and is stirred for an additional 90 minutes. A solution of Boc-5- 
aminopentanol (4.3 g; ca. 20 mmol) and trietheylamine (3 ml; 22 mmol) in DCM (20 nJ) 
is then added drop-wise. The reaction is allowed to stir at room temperature for 10 
minutes, l/f-tetrazole is added (0,35 g; 5 mmol) and the reaction stirred for another 2 

25 hours. DCM is added (100 ml) and the solution extracted 3 times with IM Na2HP04 (pH 
4), saturated NaHC03, and finally with brine, and then dried over MgS04. The solvents 
are removed under vacuum to give a syrup that is purified by colimm chromatography on 
silica (solvent: 1% to 2% methanol in DCM) to give 3.52 g of 11 (a syrup). 

30 4-N-Boc-aminomethylbenzoic acid N-hydroxy succinimide ester: 
Dicyclohecyldicarbodiimide (DCC; 5.15 g; 25 mmol) is added to a stirred suspension of 
4-N-Boc-aminomethylbenzoic acid (Tiger, Monmouth NJ; 5.2 g; 25 mmol) and N- 
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hydroxy succinimide (2.88 g; 25 mmol) in DCM (200 ml) plus acetonitrile (20 ml). The 
reaction is stirred overnight at 4*'C and then 3 hours at room temperature. The 
dicyclohecyl urea precipitate is removed by filtration, and the filtrate concentrated under 
vacuum to give a syrup. The syrup is dissolved in chloroform and DCM and treated with 
5 activated charcoal Addition of ether gives a v^hite crystalline solid. Recrystallisation from 
DCM and petroleum ether gives 6.2 g of the N-hydroxy succinimide ester of 4-N-Boc- 
aminomethyibenzoic acid. 

(12) Trifluoroacetic acid (TFA; 4 ml) is added to a solution of 11 (900 mg; 2.07 mmol) 
10 in DCM (5 ml). The solution is left at room temperature for 45 minutes and the solvents 

are removed under vacuum. The residual syrup is triturated by dissolving it DCM and 
methanol and adding ether. The resulting 12 (as TFA salt; syrup) is dried over vacuum in 
the presence of potassium hydroxide, and then reacted immediately widiout further 
purification (see below). 

15 

(13) 4-N-Boc-aminomethylben2oic acid N-hydroxy succinimide ester (670 mg; 2.2 mmol) 
and triethylamine (0.345 ml; 2.5 nmiol) are added to 12 (see above) in DCM (15 ml). The 
solution is stirred for 30 minutes, triettiylamine (0. 1 ml; 0.72 mmol) is added, and the 
solution stirred for additional 3 hours. DCM is added (20 ml), and the solution extracted 

20 twice with 1 M Na2HP04 (pH 4), once with saturated NaHCOa, and finally with brine, and 
then dried over MgS04. The solvents are removed under vacuum to give a syrup that is 
purified by colunm chromatography on silica (solvent: 5% methanol in DCM) to give 
0.86 g of 13 (a syrup). 

25 (14) 0.84 g 13 of 14 (1.6 mmol) is treated with TFA as described above to give 14 (as 
TFA salt; syrup) which is reacted immediately as described below. 

(15) Boc-Glu(OSu)-OBu^ (Bachem; 641 mg; 1.6 mmol) and triethylamine (0.235 ml; 1.7 
mmol) are added to 14 (see above) in DCM (15 ml). The solution is stirred for I hour, 
30 triethylamine (60 \xU 0.43 mmol) is added, and the solution stirred for 1 hour. DCM is 
added (20 ml), and the solution extracted twice with IM Na2HP04 (pH 4), once with 
saturated NaHCOs, and finally with brine, and then dried over MgSOa- The solvents are 
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removed under vacuum to give a syrup that is purified by column chromatography on 
silica (solvent: 7% methanol in DCM) to give 0.8 g of 15 (a white crystalline solid). 

EtNP-Bz-Glu (16) 0,4 g of 15 (0.56 mmol) are dissolved in DCM (5 ml) and TFA (5 ml). 
5 The solution is stirred for I hour at room temperature, and the solvents are removed under 
vacuum. The residual syrup is crystallised by dissolving it methanol and adding ether. 
Recrystallisation (in methanol and ether) gives 200 mg of 16 (as TFA salt; white solid). 

EtNP-Bz-Glu-cagedBiotin (17) Carbonyldiimidazole (6 mg, 37.5 \xmol) is added to a 
10 solution of MeNPO-CO-Biotin-OH (5, 17 mg, 35 ^mol) in DMF (1 ml). The solution is 
stirred for 60 minutes at room temperature and added to 16 (20 mg, 30 jimol). 
Triethylamine (5.5 |il, 40 |imol), DMF (1 ml) and water (0.5 ml) are added to the stirred 
reaction mixture until it became clear. The solution is stirred for 2 hours at room 
temperature and stored at -20^C. 

15 

The product of the above reaction is purified by reverse-phase HPLC on a C8 preparative 
column using a water-acetonitrile gradient in the presence of 0.1% trifluoroacetic acid. 
The peak corresponding to 17 (retention time = 23.1 minutes) is collected. The product is 
isolated by fi-eeze-drying as a yellow solid. Analysis of the product after the HPLC 

20 purification using analytical reverse-phase HPLC indicated a major product (>80%), the 
UV spectrum of which corresponded to 17. Specifically, at 355nm indicates the 
presence of the 0-methyhiitropiperonyl-carbonyl group of the caged-biotin (Pirrung & 
Huang, 1996), and a *shoulder' at 277nm, absent in caged-biotin, indicates the presence of 
the p-nitrophenyl phosphate ester of 17. The concentration of 17 is verified by 

25 hydrolysing the p-nitrophenyl phosphate ester in O.IM potassium hydroxide and 
determining the amount of p-nitrophenol released (optical density at 405nm). 

The purified 17 is also found to be a substrate for PTE leading to the release of p- 
nitrophenol (Fig. 13; monitored by the change in optical density at 405nm) with rates that 
30 are only about 6 fold slower than those observed with Paraoxon. Notably, imlike the base- 
catalysed hydrolysis of 17 which proceeds to completion (and the PTE-catalysed 
hydrolysis of Paraoxon), the PTE-catalysed hydrolysis of 17 proceeds with significant 
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rates only until half of the substrate has been hydrolysed. The second half of the substrate 
could also be hydrolysed, but only in the presence of much higher quantities of PTE and 
after long incubations (several hours to overnight). This is probably due to the fact that 
there 17 is comprised of two diastereomers (conesponding to two enantiomers with 
5 regard to the chiral phosphotriester), only one of which is an effective substrate for the 
enzyme. Indeed, stereoselectivity was previously observed with PTE and other chiral 
phosphotriesters (Hong & Raushel, 1999). 

Antibodies are generated that would recognise ethyl-phosphodiesters that are the products 
10 of hydrolysis of the corresponding p-nitrophenyl phosphotriesters. To this end, a suitable 
ethylphosphodiester derivative is synthesised and conjugated to carrier proteins as 
described below (Fig. 14). 

EtNPBG (18) (Glutaric anhydride (180 mg; 1.6 mmol) and triethylamine (0.22 ml; 1.6 
15 mmol) are added to 12 (prepared by de-protection of 1 .6 mmol of 1 1, as described above) 
in DCM (15 ml). The solution is stirred for 20 minutes, triethylamine (0.12 ml; 0.85 
mmol) is added, and the solution stinred for an additional 1 hour. DCM is added (20 ml), 
and the solution extracted twice with IM Na2HP04 (pH 4) and then dried over MgS04. 
The solvents are removed under vacuum to give a syrup that is purified by column 
20 chromatography on silica (solvent: 12.5% methanol in DCM plus 0.1% acetic acid) to 
give 445 mg of 18 (a syrup). 

Substrate conjugates EtNPBG-KLH and EtNPBG-KLH. Carbonyldiimidazole (CDI; 
32 mg, 200 |imol) is added to a solution of 18 (60 mg, 134 ^mol) in DMF (1 ml). The 

25 solution is stirred for 60 minutes at room temperature. Aliquots of the activated 18 are 
then added to 5 mg/ml solutions of bovine serum albumin (BSA) or keyhole limpet 
hemocyanin (KLH) in 0.1 M phosphate pH 8.0 (at 0,5 to 4 nmole of 18 per mg protein). 
The reactions are stirred for 1 hour at room temperature, and the resulting protein 
conjugates are dialysed extensively against phosphate buffer saline (PBS) at 4°C. The 

30 level of conjugation (hapten density or Hd) is determined by hydrolysing a sample of the 
dialysed conjugates in O.IM potassium hydroxide and monitoring the amount of released 
p-nitrophenol (at 405nm). These are found to be: 8.5 to 24 EtNPBG molecules per BSA 
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molecule and 14 to 63 per KLH molecule depending on the amount of activated 18 added 
to the protein samples. 

Product conjugates EtBG-KLH and EtBG-KLH. The EtNPBG-KLH and EthfPBG- 
5 KLH conjugates described above are dialysed against O.IM carbonate pH 11.8 for 44 
hours at room temperature, and then extensively against PBS (at 4°C). 
Anti-EtBG antibodies were elicited in rabbits by immunisation with EtBG-KLH 
(Hd=14) using published protocols (Tawfik et aL, 1993; Tawfik et al, 1997) (gift of Prof. 
Z Eshhar, Weizmann Institute of Science, Rehovot). Sera are tested by ELIS A for binding 
10 to both the substrate conjugate EtNPBG-BSA (Hd=8.5) and the corresponding product 
conjugate (EtBG-BSA; Hd=8.5). The first bleed from one of the immunised rabbits (when 
diluted 500 fold or more) exhibits the desirable selectivity, yielding high signal when 
incubated with the product conjugate and a low background (<20%) with the substrate 
conjugate. Diluting the sera in COVAp buffer (2M NaCl, 10 g, 1 MgS04 7H20, 0.04% 
15 Tween-20, 10 mM phosphate, 0.1 mM p-nitrophenol, pH 6.5) further increases selectivity, 
with background levels going below 5%. The anti-EtBG serum is purified using a HiTrap 
Protein A column (Pharmacia). The purified rabbit antibodies are labelled with an Alexa 
Fluor 488 protein labelling kit (Molecular Probes) according to the manufacturer's 
instructions. 

20 

10 ^1 (-2 xlO' beads) of 0.95 |im streptavidin-coated microbeads (Bangs, ~2 x 10' beads 
per ^1 suspension) are spun m a microfuge at 10,000 g for 3 min. and the supernatant 
removed. The beads are resuspended in 10 ^il of 50 mM Tris pH 8.3 containing EtNP-Bz- 
Glu-cagedBiotin (17) to give a final concentration of 10 jiM, 20 jiM or 30 jiM. PTE is 

25 expressed in vitro by transcription/translation of OPD.LMB3-2biotin DNA frz^ents (at 
5 nM). A commercial preparation is used (£ coli 830 Extract System for Linear 
Templates; Promega) supplemented with T7 RNA polymerise (2,000 units) and the 
reactions are incubated at 25°C for 1 .5 hours. The PTE is then assembled by the addition 
of Potassium Carbonate (10 mM) and Cobalt Chloride (ImM) in Tris buffer (10 mM pH 

30 8.0) and incubating for overnight at 4*'C. Another enzyme, that does not exhibit 
phosphotriesterase activity, methyltransferase Hae///, is also expressed in vitro by 
transcription/translation from M.Hae///.LMB3-2biotin DNA fragments (at 5 nM), and 
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then treated with carbonate and cobalt as with the PTE. 5 \il aliquot of the above reaction 
mixtures are added to the bead suspensions and the reactions are incubated for 1 hour at 
25°C in the dark. The reaction is stopped by the addition of 1 5 nl 0.1 M sodium acetate, 
pH 5.0 and transferred to ice. Each reaction is then split into two aiiquots of 15 \x\ each, 

5 one of which is placed as a spot on a layer of parafilm on the surface of an ice-cooled 
aluminium blocL This aliquot is then irradiated for 2 min with a B 100 AP UV lamp 
(UyP) held at a distance of --6 cm. The other aliquot is left in the dark. All bead samples 
are then incubated for 30 minutes at ambient temperature and washed three times with 
200 \il PBS, 0.1 % Tween 20 in a 0,45 jim MultiScreen-HV filter plate (Millipore, 

10 MAHVN4510), thoroughly resuspending between each wash. Beads (-2 x 10^) are then 
resuspended in 200 \i\ COVAp containing 100 ng/jil Alexa-488 labelled rabbit anti-EtBG 
antibodies and incubated for 1 hour at room temperature and then 1 hour at 4*^0. The 
beads were washed three times with 200 nl PBS, 0.1 % Tween 20 as above, then 
resuspended in 1 ml PBS, 0.1 % Tween 20 and 10,000 events analysed using a FACScan 

1 5 flow-cytometer (Becton Dickinson). 

As can be seen in Fig. 15, up to 20-fold increase in mean bead fluorescence is observed 
following the PTE catalysed hydrolysis of EtNP-Bz-Glu-cagedBiotin in the presence of 
streptavidin-coated beads and after UV irradiation. This is increase is observed relative to 
20 beads treated essentially the same but in the presence of another enzyme (MMaelll), with 
no phosphotriesterase activity. Notably, the differences in fluorescence signal are 
observed v/hcn both the PTE and the M./foeIII, are expressed in vitro from the 
corresponding genes and are added together with the entire content of the in vitro 
transcription/translation reaction mixture. 

25 

At high substrate concentrations the observed mean fluorescence is lower than observed at 
20 jiM. In addition, at substrate concentrations above 20 ^iM, there is essentially no 
difference in the fluorescence signal between reactions kept in the dark and those UV 
irradiated (data not shown). Since the beads, under the reaction conditions described 
30 above, start to exhibit saturation of binding signal at concentrations above 10 p.M (of 
product as detected by the subsequent addition of fluorescently-labelled anti-EtBG 
antibodies), these results may be explained by the presence of a contamination of ETKP- 
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Bz-Glu-Biotin in the preparation of EtNP-Bz-Glu-cagedBiotin. These results are also 
consistent with the previously observed background immobilisation of caged-biotin to 
avidin 'in the dark' (i.e., without UV illumination) which was as high as 15% of the 
signal observed after illumination (Sundberg et al. 1995). The 'dark' signal observed 

5 previously was ascribed to either trace contaminants of biotin in the caged-biotin 
preparation, or to weak interactions between avidin and components of the caged-biotin 
including the linker (Sundberg et al. 1995). Both mechanisnis may account for the fact 
that at high concentrations of caged-biotmylated substrate (and above the binding capacity 
of the beads), the *dark* signal becomes significant. Nevertheless, at substrate 

10 concentrations of 20 jiM, or lower, the 'dark' signal constitutes only 25%, or even less 
than 10% (e.g., at . 10 ixM EtNP-Bz-Glu-cagedBiotin) of the illuminated signal. This 
indicates that most of the PTE-catalysed hydrolysis of EtNP-Bz-Glu-cagedBiotin takes 
place whilst the substrate is in solution and not attached to the beads, and that the 
resulting product (Et-Bz-Glu-cagedBiotin), after illumination with UV light, is un-caged 

1 5 and becomes immobilised onto the microbeads. 

Example 10. 

Genes attached to beads are expressed in vitro and the resulting gene-products 
(enzymes) become immobilised to the microbeads whilst retaining catalytic activity. 
20 The immobilised enzyme catalyses a reaction with a caged-biotinylated substrate, 
and the resulting caged-biotinylated product is subsequently uncaged by UV 
irradiation and becomes attached to these beads together with the gene encoding the 
enzyme that led to its formation. Subsequently, these beads are detected by flow- 
cytometry, 

25 One format for the selection of genetic elements is where the genetic element comprises a 
gene linked to a microbead, which is translated in a microcapsule, and the translated gene- 
product is coupled back onto the microbead withm the microcapsule. Thus, 
compartmentaiisalion leads to the formation of complexes of gene-products (e.g., protems 
or enzymes) attached to the gene encoding them. These complexes could be subsequently 

30 selected for binding a ligand (see Example 12), or for enzymatic activity via a second 
compartmentalised reaction. 



wo 00/40712 PCT/GBOO/00030 

72 

For such complexes to be selected for catalytic activity, a soluble substrate should be 
available for the immobilised enzyme, and, once the catalytic reaction had been 
completed, the product of the enzymatic activity that is being selected for should become 
attached to the gene encoding this enzyme. The resulting complexes could be then sorted 
5 or selected by virtue of the product being linked to them, for example by using a 
fluorescently-labelled antibody that recognises the product. In other compartments, 
containing complexes of genes and gene-products that do not exhibit the desired 
enzymatic activity, the uiureacted substrate would become linked to the gene. These 
complexes will not be labelled with the product and will therefore be discarded 

10 

Here it is shown that an enzyme (phosphotriesterase or PTE) can be transcribed and 
translated in vitro firom genes attached to microbeads and the translated enzyme is bound 
back to the microbeads. The translated enzyme can be then modified to incorporate the 
active-site Cobalt, and its catalytic activity is retained whilst it is boimd to the microbead 

15 together the gene that encodes it The urunobilised PTE subsequently reacts with a caged- 
biotinylated substrate, and the caged-biotinylated product generated is uncaged by UV 
irradiation and captured onto the same avidin-coated beads to which the gene encoding 
the PTE is attached. Subsequently these beads are detected by flow-cytometry and are 
clearly distinguished from beads carrying a gene encoding a protein that does not exhibit 

20 phosphotriesterase activity. 

Aliquots of a suspension of 0.95 \im streptavidin-coated microspheres (Bangs, -2 x 10^ 
beads per ^1 suspension) are spun in a microfiige at 10,000 g for 3 min. The supernatant is 
removed and the beads resuspended in TNT buffer (O.IM Tris 7.5, 0.1 5M NaCl, 0.05% 

25 Tween-20). An antibody, capable of binding the Flag peptide and biotinylated (BioM5, a 
biotin-labelled anti-Flag antibody; Sigma) is added to the bead suspensions to give an 
average of 10"^ antibody molecules per bead and the mixture is incubated for several 
hours. The beads are rinsed by spinning down and resuspending them in TNT buffer to 
the original volume. Biotinylated DNA fragments N-Flag-OPD.LMB3-2biotin, or 

30 fragments that carry the T7 RNA polymerise promoter, the phase T7 gene 10 transitional 
start site and a gene encoding a different enzyme (also tagged with N-Flag peptide), e.g., 
methyltransferase Hae/// - N-Flag-M.Hae///.LMB3-2biotin) are added to the suspension 
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of antibody-coated beads at 1 .6 nM concentration and the mixture is incubated overnight 
at 4°C. The beads are rinsed 3 times by spinning down and resuspending them in TNT 
buffer. 



5 50 M-l aliquots of the above suspension of beads (-10 beads) are spun in a microfuge at 
10,000 g for 3 min. The supernatant is removed and the beads gently resuspended, on ice, 
in 50 \il of a prokaryotic in vitro coupled transcription/translation system designed for 
linear templates (Lesley et al., 1991). A commercial preparation of this system is used (£. 
coli S30 Extract System for Linear Templates; Promega) supplemented with T7 RNA 

10 polymerise (2,000 units). The reactions are incubated at 25°C for 1.5 hours and spun in a 
microfiige at 10,000 g for 3 min. The supernatant is removed and the beads resuspended 
in 100 nl of 50 mM Tris, 10 mM of potassium carbonate, pH 8.0. An aqueous solution of 
Cobalt Chloride is added to a concentration of 1 mM and the reactions incubated for 2 
hoxirs at room temperature. The beads are rinsed 4 times by spinning down and 

15 resuspending them in TNT buffer. Finally, beads are resuspended in TNT buffer to the 
original volume. 

Aliquots of the above beads are added to solutions of 0.25 mM Paraoxon in 50 mM Tris 
pH 8.3. The beads are incubated at 25*'C with occasional stirring for different periods of 

20 time. The beads are spun in a microfuge at 10,000 g for 3 min, the supernatant is removed 
and its optical density measured at 405nnL A significant change in optical density at 
405nm is observed when beads to which biotinylated DNA fragments N-Flag- 
OPD.LMB3-2biotm are attached (and are subsequently reacted as described above) in 
contrast to reactions conducted imder the same conditions but in the absence of beads or 

25 phosphotriesterase, or with beads to which N-Flag-M.Hae///.LMB3-2biotin DNA 
fragments are attached and are subsequently reacted as described above. 

Next, 10 \il (-2 xlO* beads) of the above beads are spun in a microfuge at 10,000 g for 3 
min. and the supernatant removed. The beads are resuspended in 10 ^1 of 12.5 or 25 \iM 
30 EtNP-Bz-Glu-cagedBiotin in 50 mM Tris pH 8.3. The bead suspensions are incubated for 
1.5 hour at 25'*C in the dark. The reaction is stopped by the addition of 10 ^1 0.1 M 
sodiimi acetate, pH 5.0 and transferred to ice and irradiated for 2 min with a B 100 AP 
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UV lamp (UVP) held at a distance of - 6 cm. All bead samples are then incubated for 30 
minutes at ambient temperature and then washed three times with 200 \xl PBS, O.l % 
Tween 20 in a 0.45 jim MultiScreen-HV filter plate (Millipore, MAHVN4510), 
thoroughly resuspending between each wash. Beads (-7 x 10^) are then resuspend in 125 

5 |il of a rabbit anti-EtBG serum diluted 1:125 in COVAp and incubated for overnight at 
4^C. The beads are washed once with 200 |il COVAp and then 3 times with 200 |il PBS, 
0.1 % Tween 20 as above and are resuspended in 200 ^l PBS, 0.1 % Tween 20. 70 ^il of 
the above bead suspensions (-2 x 10^) are added to 50 ^il of 40 ng/|il FITC-labelled goat 
anti rabbit Fab (Jackson 1 15-095-006) in PBS. 0.1 % Tween 20 and incubated 1 hour at 

10 room temperature. The beads are washed 3 times with 200 |il .PBS, 0.1 % Tween 20 as 
above, then resuspended in 1 ml PBS, 0.1 % Tween 20 and 10,000 events analysed using 
a FACScan flow cytometer (Becton Dickinson). 

Consequently, as seen in Fig. 16, beads to which genes encoding the phosphotriesterase 
tagged with the Flag peptide were attached (along with an antibody that binds the Flag 
15 peptide) could be clearly distinguished from genes to which other genes, encoding 
enzymes with no phosphotriesterase activity (e.g., N-Flag-M.Haelll), were attached. 

Example 11. 

E, coli BirA transcribed and translated in vitro catalyses a reaction which gives rise 
20 to a change in the fluorescence properties of substrate-labelled microspheres in the 
aqueous compartments of a water-in oil emulsion. 

The gene encoding a peptide from Propionibacterium shermanii which is biotinylated in 
vivo m £. colt is amplified using oligonucleotides BCCP5 and BCCP3 from the vector 

25 Pinpoint Xa-1 (Piomega). The PGR fragment is cloned into the vector pET-23d(FLAG) 
digested with BamWi and HindHl, downstream of a T7 RNA polymerase promoter and the 
phage T7 gene 10 translational start site, and m frame with an N-terminal FLAG peptide- 
coding region; this vector is termed pET-23d(FLAG-BCCP). The vector pET- 
. 23d(FLAG) is identical to the vector pET-23d (Novagen) except for the region between 

30 the unique Ncol and BamHl sites, which has been modified to include an N-terminal 
FLAG peptide-coding region as shown below in Scheme 2. In order to append a 
hexahistidine tag to the C-terminus of the protein, the two oligonucleotides BCCPHis+ 



10 
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and BCCPHis- were annealed and then ligated into the vector pET-23d(FLAG-BCCP) 
digested with Sad and Nod, yielding the vector pET-(FLAG-BCCP-His). The protein 
FLAG-BCCP-His (termed FBH) is overexpressed in strain C41(DE3) (Miroux & Walker, 
1996), harvested and purified with Ni-NTA agarose (Qiagen) under native conditions, 
following the manufacturer's protocol. Biotinylated protein is depleted by incubation with 
an equal volume of avidin-agarose (Sigma), pre-equilibrated with a wash buffer (50 mM 
NaH2P04, pH 8.0; 300 mM NaCl; 20 mM imidazole) for 1 hour at 4^C. The suspension 
is then centrifuged at 10,000 g for 2 minutes and the supernatant retained, aliquoted and 
stored in liquid nitrogen (long-term) or at 4*0. 

MDYKDDDDKMHGNE G 
TATACCATGGACTACAAAGATGACGATGATAAAATGCATGGCAACGAAGGT 



pET-23d - Ncol site (appended FLAG coding region) 
15 T 

ACC GGATCC AAGCTT 

BamHl site of pET-23d Hindlll site 



Scheme 2 

20 

The gene encoding E, colt BirA was amplified by PCR using oligonucleotides BirA5 and 
BirA3 firom a pBluescript 2SK+ vector containing the E. coli BirA gene (gift from P. 
Wang, unpublished). The PCR fragment is cloned into the vector pGEM-4Z(K2) digested 
with Kpnl and Xhol downstream of the lac promoter, T7 RNA polymerase promoter and 
25 the efficient phage T7 gene 10 translational start site. The vector pGEM-4Z(K2) is 
identical to the vector pGEM-4Z^^^ (see Example 8, Scheme I), except for the region 
between the unique Ncol and Kpnl sites, which has been modified according to Scheme 3 
shown below to contain a unique Xhol site downstream of the Ncol site, 
c* 

30 M G G . S S 

CCATGGGGGGCTCGAGC GGTACC 

pGEM-4Z*'"^— Wcol Xhoi Kpnl site of pGEM-4Z''"^ 
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Scheme 3 

DNA sequencing identifies a clone with the correct nucleotide sequence, termed pGEM- 
BirA. The pGEM-BirA plasmid described above is amplified by PGR using primers 
5 LMB2 and LMB3 as above to create a 1139 base pair PGR ft^gment (BirA_LMB2-3) 
which carries the T7 RNA polymerase promoter, the phage T7 gene 10 translational start 
site and the BirA gene. The PGR fragment is purified directly using Wizard PGR Preps 
(Promega). 

10 60 ^iL aliquots (1.2 x 10^ beads) of 1.0 ^un diameter nonfluorescent goat anti-mouse IgG 
labelled microspheres (Bangs Laboratories, GP03N) were spim in a microfiige at 
approximately 2,600 g (6,000 rpm) for 3 minutes. The supernatant was removed and the 
beads resuspended in 60 0.1 M Tris-HGl, pH 7.5, 0.15 M NaGl, 0.05% Tween-20, 
0.5% BSA. The beads were spun again, resuspended in 60 \iL M5 anti-FLAG antibody 

15 (Sigma F4042) and incubated overnight at 4**C. The beads were spun again (2,600 g) for 
3 minutes, the supernatant was removed, and the beads were resuspended in a mixture of 
30 ^iL 0.1 M Tris-HGl, pH 7.5, 0.15 MNaGl, 0.05% Tween-20, 0.5% BSA and 30 of 
FBH protein obtained as above (final protein concentration approx. 4 mg / ml) and 
incubated for 1 hour at room temperature. 

20 

Meanwhile, 60 ixL aliquots of a prokaryotic in vitro coupled transcription/translation 
system designed for linear templates (Lesley et al., 1991) was prepared, using a 
conunercial kit (£". coli 830 Extract System for Linear Templates; Promega), 
supplemented with T7 RNA polymerase (2,000 units), 10 nM BirA_LMB2-3 DNA (or no 
25 DNA at all). These aliquots were incubated at 25**G for 1 hour to allow translation. 

The 60 jiL aliquots of beads were spun at 2,600 g (6,000 rpm) in a microfuge for 3 
minutes and the supernatant removed. They were resuspended in 60 \iL of 0.1 M Tris- 
HGl, pH 7.5, 0.15 M NaGl, 0.05% Tween-20, 0.5% BSA, respun and the supernatant 
30 removed. Finally they were resuspended on ice in a 54 |iL aliquot of the prokaryotic in 
vitro coupled transcription/translation reactions described above, supplemented with 3 ^L 
of 2 mM d-biodn and 3fiL of 0.2 M ATP. 
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A 5 ^ii aliquot was removed from each reaction mixture and left non-emulsified. 50 ^l of 
the remaining reaction mixture was emulsified essentially as Tawfik & Griffiths (1998), 

5 The oil phase was freshly prepared by dissolving 4.5% (v/v) Span 80 (Fluka) in mineral 
oil (Sigma, #M-5904) followed by 0.5% (v/v) Tween 80 (SigmaUltra; #P-8074). Ice- 
cooled reaction mixtures were added gradually (in 5 aliquots of 10 \il over -2 minutes) to 
1.0 ml of ice-cooled oil-phase in a 5 ml Biofreeze Vial (Costar, #2051) whilst stirring 
with a magnetic bar (8x3 mm with a pivot ring; Scientific Industries International, 

10 Loughborough, UK). Stirring (at 1 150 rpm) was continued for an additional 1 minute on 
ice. 

All reactions were incubated for 4 hours at 37*^0 to allow the biotinylation reaction to 
proceed. 

15 

The emulsions were transferred to 1.5 ml microfuge tubes, spun 1 min. 13.5k rpm in a 
microftige and the oil phase removed leaving the concentrated (but still intact) emulsion at 
the bottom of the tube. 200 ^il O.l M Tris-HCl, pH 7.5, 0.15 M NaCl, 0.05% Tween-20, 
0.5% BSA were added and the emulsion broken by extracting 4 times with 1 ml hexane, 
20 vortexing between each hexane addition. Residual hexane was removed by spuming for 
10 min at ambient temperature under vacuum in a Speedvac (Farmingdale, NY). 

Approximately 1 x 10^ beads from die broken emulsions and the non-emulsified reactions 
were then washed twice with 100 \i\ TNT / BSA in a 0.45 \im MultiScreen-HV filter plate 

25 (Millipore, MAHVN45 1 0), thoroughly resuspending between each wash. Beads were then 
resuspend in 50 ^l 0.1 M Tris-HCl, pH 7.5, 0.15 M NaCl, 0.05% Tween.20, 0.5% BSA 
containing 1 nL of a streptavidin-HRP solution (provided with the NEN TSA™-Direct 
kit) and incubated for 30 minutes at ambient temperature. The beads were washed twice 
with 100 ^l 0.2 M Tris, 10 mM imidazole, pH 8.8, as above, then resuspended in 50 ^iL 

30 0.2 M Tris, 10 mM imidazole, pH 8.8, 0.01% H2O2. 1 |iL of a fluorescein tyramide stock 
solution (made up according to the manufacturer's instructions (NEN TSA™-Direct kit)) 
was added, and the reaction left to proceed for ten minutes. The beads were washed twice 
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with PBS, as above, and finally resuspended in a total of 500 [iL PBS, transferred to a 5 
ml polystyrene round-bottomed tube (Falcon) and 10,000 events analysed using a 
FACScan flow cytometer (Becton Dickinson). 

5 As can be seen from Fig. 17, both in emulsified and non-emulsified reactions, the reaction 
catalysed by in vitro translated BirA results in beads with higher fluorescence than vrtien 
no enzyme was present. It appears that beads which have been incubated in an emulsion 
with in vitro translated BirA are more fluorescent than beads which have not been 
incubated in emulsions. 

10 

Example 12 

A change in fluorescence of genetic elements can be used to selectively enrich genetic 
elements encoding peptides with a binding activity. The fluorescently labelled 
15 genetic elements are isolated by flow cytometric sorting. 

One format for the selection of genetic elements is where the genetic element comprises a 
. gene linked to a microbead, which is translated in a microcapsule, and the translated gene- 
product is coupled back onto the microbead vAMn the microcapsule. Thus, 
20 compartmentalisation leads to the formation of complexes of gene-products attached to 
the gene encoding them. These complexes can subsequently be selected for binding to a 
iigand by flow cytometric sorting if the binding interaction results in a change in 
microbead fluorescence. 

25 pET-23d(FLAG) vector encodes N-terminal FLAG-peptide fused to the polylinker region 
of pET23d (Novagen). pET23d was digested with Nco 1/ BamH I, gel purified and 
redissolved in water. Two synthetic phosphorylated oligonucleotiodes (Vh Bio Ltd, 
Newcastle upon Tyne, U.K.), FLAG and FLAGas, were mixed at 1 \iM concentration 
each in water, heated for 3 min at 94''C and allowed to cool to room temperature before 

30 being added to the digested vector in the ligation mix. The ligation reaction was used 
unpurified to transform Kcoli TG-1. Clones containing the insert were identified by Kpn I 
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digest and verified by sequencing (Oswel Research Product Ltd, Southampton, U.K.). The 
polylinker region of pET-23d(FLAG) is as follows: 



5 Ncol Kpnl 

10 20 30 40 50 

CCATGGACTACAAAGATGACGATGATAAAATGCATGGCAACGAAGGTACC 
GGTACCTGATGTTTCTACTGCTACTATTTTACGTACCGTTGCTTCCATGG 
MDYKDDDDK 
10 < FLAG -peptide tag > 

BamHI EcoRI Sad Sail Hindlll Not I Xhol 

60 70 80 90 

GGATCCGAATTCGAGCTCCGTCGACAAGCTTGCGGCCGCACTCGAGCA 
15 CCTAGGCTTAAGCTCGAGGCAGCTGTTCGAACGCCGGCGTGAGCTCGT 



Biotinylated FLAG-HA expression construct was prepared from the pET-23d(FLAG) 
vector by PGR. The peptide sequence YPYDVPDYA from the influenza haemagglutinin 
20 was appended to the FLAG-tag in pET-23d(FLAG) using the primer FLAGHA and the 
5'-biotinylated primer pETrev.b. The amplification product is 903 bases long and the 
coding region of the construct is: 

10 20 30 40 50 

25 ATGGACTACAAAGATGACGATGATAAAATGCATGGCAACGAAGGTACCGG 
TACCTGATGTTTCTACTGCTACTATTTTACGTACCGTTGCTTCCATGGCC 
MDYKDDDDKMHGNEGTG 
< FLAG-peptide tag > 

30 60 70 80 90 100 

ATCCGGAGGAGGATATCCGTATGATGTGCCGGATTATGCGGGAGGAGGATCCTAA 



TAGGCCTCCTCCTATAGGCATACTACACGGCCTAATACGCCCTCCTCCTAGGATT 
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SGGG YPYDVPDYAG GGS 
< HA-peptide tag > 

5 

The competitor construct in the selection process is E. coli folk gene encoding 
dihydrofolate reductase amplified from pET23a/folA using primers pETfor and pETrev.b. 

10 PGR fragments were gel-purified using QIAquick Gel Extraction kit (Qiagen). DNA 
concentration was measured by UV spectrophotometry. Dilutions of PGR -prepared 
expression constructs were made in 0.5 mg/ml carrier DNA prepared fi-om Hind III 
digested lambda phage DNA (40 mm at SO^'C, followed by ethanol-precipitation and 
dissolution in water). 

15 

2x10^ streptavidin-coated 0.95 nm polystyrene beads in a 100 \i\ aliquot of 1% 
suspension (Bangs Laboratories, Inc. CPOIN) were spun in a microfuge at approximately 
2,600 g (6,000 rpm) for 3 minutes. The supematant was removed and the beads 
resuspended in 100 ^iL 0.1 M Tris-HCl, pH 7.5, 0.15 M NaCl, 0.05% Tween-20, 0.5% 

20 BSA (TNTB). 7 ^il of 2 mg/ml biotinylated anti-FLAG monoclonal antibody M5 (Sigma) 
was added to the resuspended beads and the mix was incubated at room temperature for 
two hours. Following coating with the antibody, the beads were washed for three times 
with 200 lil TNTB, resuspended in 100 \i\ TNTB and split into 10 ^il aliquots I and 2 and 
40 nl aliquots 3 and 4. 0.7 nM stock solution of either, (#1) pure FLAG-HA DNA,(#2) 

25 pure folA DNA, or (#3 and #4) pure FLAG-HA DNA diluted in a 1000 fold excess of 
folA DNA were prepared in Hind IE-digested lambda DNA and applied to the bead 
aliquots. The binding reaction was allowed to proceed overnight at 4®G. The maximum 
number of genes per bead was 2 in aliquots 1-3 and 0.2 in aliquot 4. The beads coated 
with FLAG-HA construct served as positive control and the beads coated with iFolA as 

30 negative control. 



# 


DNA 


Ratio folA: 


Beads 


DNA 


DNA 


Molecules of 


S30 


Emulsion 






FLAG-HA 




(nM) 


{^0 


DNA^ead 


(^1) 


(ml) 
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1 


FLAG-HA 




2x10* 


0,7 


1 


2 


25 


0.5 


2 


folA 




2x10* 


0.7 


I 


2 


25 


0,5 


3 


folA:HA 


1000:1 


8x10" 


0.7 


4 


2 


50 


2x0.5 


4 


folA:HA 


1000:1 


8x10* 


0.7 


0,4 


0.2 


50 


2x0.5 



After overnight incubation at 4°C, the beads were washed twice in TNTB and 
resuspended in S30 in vitro translation mixture (S30 Extract System for Linear Templates, 
Promega) supplemented with T7 RNA polymerase (20 units/ \xl). 

5 

The ice-cooled in vitro translation reactions were added gradually (in 5 aliquots of 10 jil 
over ~2 minutes) to 0.5 ml of ice-cooled oil-phase (freshly prepared by dissolving 4.5% 
(v/v) Span 80 (Fluka) in mineral oil (Sigma, #M-5904) followed by 0.5% (v/v) Tween 80 
(SigmaUltra; #P-8074)in a 5 ml Costar Biofreeze Vial (#2051)) whilst stirring with a 
10 magnetic bar (8x3 mm with a pivot ring; Scientific Industries International, Loughborough, 
UK). Stirring (at 1150 rpm) was continued for an additional 3 minutes on ice. Reactions 
were then incubated 90 min at 30^C. 

The emulsions were transferred to 1.5 ml microfuge tubes, spun 8 min. 6.5k rpm m a 
1 5 microfuge and the oil phase removed leaving the concentrated (but still intact) emulsion at 
the bottom of die tube. 200 \il 0.1 M Tris-HCl, pH 7.5, 0.15 M NaCl, 0.05% Tween-20 
(TNT) were added and the emtilsion broken by extracting 4 times with 1 ml hexane, 
vortexing between each hexane addition. Residual hexane was removed by bubbling air 
through the suspension of beads for 1-2 min at ambient temperature. 

20 

Beads from the broken emulsions were then washed twice with 100 ^l TNT in a 0.45 \im 
MultiScreen-HV filter plate (Millipore, MAHVN4510), tiioroughly resuspending between 
each wash. Beads were then resuspend in TNTB at 10^ beads/^il and containing 100 
mlJ/ml rat anti-HA -Peroxidase, High Affinity (3F10) conjugate (Boehringer Mannheim). 

25 

The beads were incubated with the antibody for 30 minutes at ambient temperature and 
washed three times witii 200 pX TNT before being resuspended in 2 ml of 0.2 M Tris, 10 
mM unidazole, pH 8.8. The suspended beads were sonicated for 1 min on ice using Heat 
Systems sonicator at power 1, 95% cycle, 3.4 mm tip. The sonicated beads were 



wo 00/40712 PCT/GBOO/00030 

82 

resuspended at 10* beads/ml in 0.2 M Tris, 10 mM imidazole, pH 8,8. To this suspension 
of beads an equal volume of tyramine signal amplification (TSA) buffer 0.2 M Tris, 10 
mM imidazole, pH 8.8, 0.004% H2O2, 5 ng/ml fluorescein tyramine was added. 

5 Fluorescein tyramine was synthesised as described by Hopman et al. (Anthon H.N. 
Hopman, Frans C.S. Ramaekers, Emst J.M. Speel, The Journal of Histochemistry and 
Cytochemistry vol 46(6), 771-777, 1998). 

The reaction is left to proceed for five minutes at room temperature and stopped by 
10 addition of 1/10* of volume of 10% bovine serum albumin in PBS (BSA, Sigma). The 
beads were spun down in 2 ml aliquots of the labelling reaction and washed 2 times in 
TNTB and once in PBS. Finally the beads were resuspended in 2 ml of PBS and sonicated 
as above. 

15 The beads coated v^th genes encoding folA, FLAG-HA or 1000-fold dilution of FLAG- 
HA in folA were analysed on a Becton Dickinson FACScan flow cytometer. 

In Figure 18, low resolution histogram A demonstrates that the beads carrying FLAG-HA 
DNA (sample #1) are significantly more fluorescently labelled than the negative control 
20 folA (sample #2). The spiked mixtures #3 and #4 run predominantly identically to 
negative control sample except for a small number of highly fluorescent beads (panel B). 
0.04% of beads in sample #3 and 0.02% of beads in sample #4 fell into the region Ml 
that covers 95% of positive events. 

25 The beads in samples #3 and #4 that fell into region Ml were sorted using a MoFlo 
fluorescence-activated cell sorter. Two sets of sorted beads were acquired for both 
samples #3 and #4. In set one 500 beads were collected into a single tube. In set two 96 
beads were collected individually into the wells of a 96-well plate. Both sets of beads 
were subjected to 35-cycle PGR using primers pETrev.b and FLAGrevl. 

30 

The amplification products were analysed by gel electrophoresis (Figure 19). The product 
sizes are 903 bases for FLAG-HA and 1390 bp for folA. 
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The gel electrophoretic analysis of the amplification reaction products suggests 
significant enrichment during the course of sorting. In panel A there are no FLAG-HA 
bands visible on the lanes of the products amplified from unsorted reactions #3 and #4 
whereas the FLAG -HA band in the samples firom the sorted beads is strongly visible. 
5 Definitive data regarding the nature of the amplified DNA were obtained firom the 
analysis of DNA amplified firom single beads. In total 22 beads out of 96 yielded a DNA 
product for reaction #3 and 50% of these were pure FLAG-HA* For reaction #4 9 beads 
yielded products and 8 were FLAG-HA. 

10 Single-bead data for reaction #3 suggests that at the concentration applied, nominally 2 
DNA moleculesAjead, most of the beads in fact have only one gene attached allowing 
unambiguous linkage between the gene and its product. Relatively high number of 
positively labelled beads meant however that about 50% of the beads recovered were false 
positives. In sample #4 where there were only ==0.1 genes/bead the purity of the 

1 5 recovered DNA approached 90%, indicating nearly 1 000-fold enrichment in one step. 

Oligonucleotides 

EDHFR-Fo 5'-CGA GCT AGA GGT ACC TTA TTA CCG CCG CTC CAG AAT CTC AAA 
20 GCA ATA G-3' 

EDHFR-Ba 5 '-GCA TCT GAC AAG CTT AAT AAT TTT GTT TAA CTT TAA GAA GGA 
GAT ATA CAT ATG ATC AGT CTG ATT GCG GCG TTA GCG GTA G-3' 

LMB2-3iotin 5 ' -Biotin-GTA AAA CGA CGG CCA GT-3 ' 

folA-FW 5* 'GCG CGA AGC TTC GAT CAG TCT GAT TGC GGC G-3' 

25 folA-BW 5 '-GCG CCT CGA GTT CCG CCG CTC CAG AAT CTC-3' 

pETfor.b 5'-Biotin-GAC TCC AAC GTC AAA GGG CG-3' 

pETrev.b 5 ' -Biotin-GGT TTT CAC CGT CAT CAC CG-3' 

GFP-tW 5 '-GCG CGA AGC T TCG AGT AAA GGA GAA GAA CTT TTC-3* 

GFP-5W 5' -GCG CCT CGA GTT TTG TAT AGT TCA TCC ATG CCA TG-3' 

30 GSTM2-2FO 5'-TGA TGC CGG TAG CTT ATT ACT TGT TGC CCC AGA CAG CC-3' 

GSTM2-2Ba 5 '-AGT TAA GTC TAA GCT TAA TAA TTT TGT TTA ACT TTA AGA AGG 
AGA TAT ACA TAT GCC CAT GAC ACT GGG GTA C-3' 

LMB2 5' -GTA AAA CGA CGG CCA GT-3' 

LMB3 5' -CAG GAA ACA GCT ATG AC-3' 

35 N-Flag-OPD-Fo 5' -TCG ATA CGT CGG TAC CTT ATT ATG ACG CCC GCA AGG TCG 
GTG-3' 

N-?iag-0PD-Bc 5' -CAT TGC CAA GCC ATG GAC TAC AAA GAT GAC GAT GAT AAA ATC 
ACC AAC AGC GGC GAT CGG ATC AAT ACC G-3* 
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5'-C7A GGT CAT GGA TCC ATG AAA CTG AAG GTA ACA GTC AAC GGC- 
3' 

5'-CAG ATA GCT AAG CTT TTA TTA TTC GAT GAG CTC GAG ATC CCC- 
3* 

5 '-CAT CGA AGG TGG CAG CTC TGC-3 ' 

5'-GGC CGC AGA GCT GCC ACC TTC GAT GAG CT-3' 

5' -ATC GTA GCA CTC GAG CAT GAA GGA TAA CAC CGT GCC A- 3' 

5' -GTC ATG ACT GGT ACC TTA TTA TTT TTC TGC ACT ACG CAG- 3' 

5 '-CAT GGA CTA CAA AGA TGA CGA TGA TAA AAT GCA TGG CAA CGA 

AGG TAC CG-3* 

5' -GAT CCG GTA CCT TCG TTG CAT GCA TTT TAT CAT CGT CAT CTT 
TGT AGT C-3' 

5' -AAC TCA GCT TCC TTT CGG GCT TTG TTA GGA TCC TCC TCC CGC 
ATA ATC CGG CAC ATC ATA CGG ATA TCC TCC TCC GGA TCC GGT ACC 
TTC GTT GCC- 3' 

5'-biotin-GGT TTT CAC CGT CAT CAC CG-3' 
5'-GAC.TCC AAC GTC AAA GGG CG-3' 
5' -AAC TCA GCT TCC TTT CGG GC-3' 



20 
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1. A method for isolating one or more genetic elements encoding a gene product 
having a desired activity the expression of which may result, directly or indirectly, in the 

5 modification of an optical propert>' of a genetic element encoding the gene product, 
comprising the steps of: 

(a) compartmentalising genetic elements into microcapsules; 

(b) expressing the genetic elements to produce their respective gene products within 
the microcapsules; 

10 (c) sorting the genetic elements which produce the gene product(s) having the desired 
activity according to a change in the optical properties of the genetic elements. 

2. A method according to claim 1 , wherein in step (b) the activity of the desired gene 
product within the microcapsule results, directly or indirectly, in die modification of the 

15 genetic element encoding the gene product to enable the isolation of the genetic element, 

3. A method according to claim 2, wherein in the modification of the genetic element 
within the microcapsule induces a change in its optical properties. 

20 4. A method according to claim 2, wherein the modification of the genetic element 
enables it to be fiirther modified outside the microcapsule so as to induce a change in its 
optical properties. 

5. A method according to claim 2, wherein a part of the genetic element is a ligand 
25 and the desired gene product within the microcapsule binds, directly or indirectly, to said 

ligand to enable the isolation of the genetic element. 

6. A method according to claim 5, wherein the ligand is also encoded by tiie genetic 
element. 

30 

7. A method according to claim 2, wherein a part of the genetic element is a substrate 
and the activity of the desired gene product within the microcapsule results, directiy or 
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indirectly, in the conversion of said substrate into a product which remains part of the 
genetic element and enables its isolation. 

8. A method according to claim 2, wherein the product of the activity of the desired 
5 gene product within the microcapsule results, directly or indirectly, in die generation of a 

product which is subsequently complexed with the genetic element and enables its 
isolation. 

9. A method according to any preceding claim wherein the activity of the desired 
1 0 gene product within the microcapsule results, directly or indirectly, in the alteration of the 

expression of a second gene within the compartment and the activity of the product of the 
said second gene enables the isolation of the genetic element using a change in the optical 
properties of the genetic element. 

15 10. A method according to claim 1 , wherein step (b) comprises: 

expressing the genetic elements to produce their respective gene products within the 
microcapsules, linking the gene products to the genetic elements encoding them and 
isolating the coinplexes thereby formed, 

20 11. A method according to claim 10, wherein in step (c) the complexes are directly 
sorted based on their changed optical properties to isolate genetic elements encoding a 
gene product having the desired activity. 

12. A method according to claim 10, wherein in step (c) the complexes are further 
25 reacted to induce a conditional change in optical properties of the genetic element 

dependent on the presence of gene products with the desired activity in the complex. 

13. A method according to claim 10, wherein the complexes are subjected to a further 
compartmentalisation step in order to isolate the genetic elements encoding a gene 

30 product having the desired activity. 
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14. A method according to claim I, wherein the change in optical properties of the 
genetic element is due to binding of a gene product with distinctive optical properties to 
the genetic element. 



5 15. A method according to claim 1, wherein the change in optical properties of the 
genetic element is due to binding of a ligand with distinctive optical properties by the 
gene product. 

16. A method according to claim 1, wherein the change in optical properties of the 
10 genetic element is due to a change in the optical properties of the gene product when 

bound to ligand. 

17. A method according to claun 1, wherein the change in optical properties of the 
genetic element is due to a change in the optical properties of the ligand when boimd by 

15 the gene product. 

18. A method according to claim I, wherein the change in optical properties of the 
genetic element is due to a change m the optical properties of both ligand and gene 
product on binding. ' 

20 

19. A method according to claim 1, wherein the change in optical properties of the 
genetic element is due to the different optical properties of the substrate and the product 
of the reaction being selected. 

25 20. A method according to claim 1, wherein both substrate and product have similar 
optical properties, but only the product, and not the substrate of the reaction being 
selected binds to, or reacts with, the genetic element, thereby changing the optical 
properties of the genetic element. 

30 21. A method according to claim 1, wherein further reagents specifically bind to, or 
specifically react with, the product (and not the substrate) attached to the genetic element, 
thereby altering the optical properties of the genetic element. 
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22. A method according to any preceding claim, wherein a non-desired activity of a 
gene product results in a change in the optical properties of the genetic element which is 
distinct from that resulting from the desired activity. 

5 

23. A method according to claim 22, wherein the optical change resulting from the 
non-desired activity is used to negatively select the genetic elements. 

24. A method according to claim 22, wherein negative selection is combined with 
10 positive selection to improve reaction specificity. 

25. A method according to claim 24, wherein the improved reaction specificity is an 
improvement in binding specificity. 

15 26. A method according to claim 24, wherein the improved reaction specificity is an 
improvement in regio- and/or stereo-selectivity for substrate and/or product 

27. A method according to any preceding claim, wherein the genetic elements are 
isolated from a library of genetic elements encoding a repertoire of gene products. 

20 

28. A method according to any precedmg claim, wherein each genetic element 
encodes more two or more genes and each gene product must have a desired activity in 
order for the optical properties of the genetic element to be modified to enable them to be 
sorted. 

25 

29. A method according to any preceding claim, wherein each genetic element 
encodes two or more genes and the gene products must bind to each other in the 
microcapsule in order for the optical properties of the genetic element to be modified and 
the genetic elements sorted. 

30 

30. A method according to any preceding claim fiuther comprising the additional step 
of: 
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(d) introducing one or more mutations into the genetic element(s) isolated in step (c). 

31. A method according to any preceding claim further comprising iteratively 
repeating one or more of steps (a) to (d). 

5 

32. A method according to any preceding claim further comprising amplifying the 
genetic elements. 

33. A method according to claim 1, wherein microencapsulation is achieved by 
10 forming a water-in-oil emulsion of the aqueous solution in an oil-based medium. 

34. A method according to claim I, wherein the genetic element comprises the gene 
attached to a microbead. 

15 35. A method according to claim 1, wherein the microbead is nonmagnetic, magnetic 
or paramagnetic. 

36. A method according to claim 1, wherein the genetic elements are sorted by 
detection of a change in their fluorescence. 

20 

37- A method according to claim 36, wherein the sorting of genetic elements is 
performed using a fluorescence activated cell sorter (FACS) (or similar device). 

38. A method according to claim 36, wherein the different fluorescence properties of 
25 the substrate and the product are due to fluorescence resonance energy transfer (FRET). 

39. A method according to any preceding claim, wherein the internal environment of 
the microcapsules is modified by the addition of one or more reagents to the oil phase. 

30 40. A method according to any preceding claim, wherein genetic elements modified directly 
or indirectly by the activity of the desired gene product are further modifled by Tyramide Signal 
Amplification (TSA™; NEN), resulting directly or indirectly in a change in the optical properties 
of said genetic elements thereby enabling their separation. 
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41. A product when isolated according to the method of any preceding claim. 

42. A method for preparing a gene product, comprising the steps of: 
5 (a) preparing a genetic element encoding the gene product; 

(b) compartmentalising genetic elements into microcapsules; 

(c) expressing the genetic elements to produce their respective gene products within the 
microcapsules; 

(d) sorting the genetic elements which produce the gene product(s) having the desired 
1 0 activity using a change in then: optical properties; and 

(e) expressing the gene product having the desired activity. 

43. A method for screening a compoimd or compounds capable of modulating the 
activity of a gene product, comprising the steps of: 

1 5 (a) preparing a repertoire of genetic elements encoding gene product; 

(b) compartmentalising the genetic elements into microcapsules; 

(c) expressing the genetic elements to produce their respective gene products within the 
microcapsules; 

(d) sorting the genetic elements which produce the gene product(s) having the desired 
20 activity using a change in their optical properties; and 

(e) contacting a gene product having the desired activity with the compound or 
compoxmds and monitoring the modulation of an activity of the gene product by the 
compound or compounds. 

25 44. A method for preparing a compound or compoimds comprising the steps of: 

(a) providing a synthesis protocol wherein at least one step is facilitated by a 
polypeptide; 

(b) preparing genetic elements encoding variants of the polypeptide which facilitates 
this step; 

30 (c) compartmentalising the genetic elements into microcapsules; 

(d) expressing the genetic elements to produce their respective gene products within 
the microcapsules; 
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(e) sorting the genetic elements whicli produce polypeptide gene product(s) having the 
desired activity using a change in their optical properties; and 

(f) preparing the compound or compounds using the polypeptide gene product 
identified in (e) to facilitate the relevant step of the synthesis. 
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FIGURE 6 




(2) McNPOCO-lm 





(7) Mer(FO-CO-Bioli#-AM)H 



0 

^ CDI Oxiaiied qutithione DTT ^ fj^'^'^rnt ^^qq H / ^^^^ 



OMF DMF/HiO:E^ O \_/ "^^^^-.-....^/''^ |^^^V^^M 

V. / — {CHlM^O.NH-<CHhCONH ® \ 

(8) McNPOXO-Bioti^s-Glotiildoae 



wo 00/40712 



PCT/GBOO/00030 



7/19 

FIGURE 7 
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FIGURE 10 
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FIGURE 11 
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FIGURE 13 
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FIGURE 14 
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FIGURE 15 
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FIGURE 16 
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FIGURE 17 
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FIGURE 18 
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FIGURE 19 
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Selection Svstem 



The present invention relates to a screening system useful for screening repertoires of 
DNA binding domains. In particular the invention relates to a screening system based on 
transcriptional activators of bacterial a54-dependent promoters. 



The majority of proteins involved in cellular functions do so by interacting with other 
proteins or nucleic acid sequences within the cell. Several approaches have been described 
that allow the in vivo selection of nucleic acids which express polypeptides capable of 

10 binding to proteins or DNA in the cell. Arguably the most powerful approaches are the 
yeast one- and two hybrid systems (Fields S. & Song O. (1989) Nature 340. 245; see US 
Patent 5,283,173. incorporated herein by reference in its entirety) for the screening of 
protein-DNA and protein-protein interactions, respectively. However, the two-hybrid 
system requires an eukaryotic host and consequently the diversity that can be screened is 

15 limited. Furthennore the system notoriously suffers from an abundance of false positives. 

Larger molecular repertoires can be prepared in bacterial hosts and a number of bacterial 
systems for the screening of protein-protein and protein-DNA interactions have also been 
reported. Two systems have been put forward in which the polypeptide chain of an enzyme 
20 is expressed in two parts fiised to two candidate polypeptides, and in which interaction 
between the candidate polypeprides reconstitutes the fiinction of the enzyme (Karimova G. 
et al (1998) Proc. Nat. Acad. Sci USA 95, 5752; Pelletier J.N. et al (1998) Froc. Nat. 
Acad. Sci USA 95, 12141). 

25 Several in vivo screens for DNA-binding proteins have also been reported (reviewed in 
Mossing M.C., Bowie J.U. & Sauer R.T. (1991) Methods Enzymol. 208, 604; EUedge S.J. 
et al (1989) Proc. Nat. Acad Sci USA 86, 3689). Each of these methods mvolves the 
blockage of a hybrid <t70 promoter by the DNA binding protein. Repression of the 
promoter eidier prevents the production of conditionally toxic gene or alleviates repression 

30 of an antibiotic gene by transcriptional interference. The transcriptional interference assay 
(Elledge et al.) has been used successfully in one case to select DNA binding proteins with 
altered specificity (Sera T. & Schultz P.G. (1996) Proc. Nat. Acad Sci USA 93, 2920). 



wo 01/18244 



PCT/GBOO/03450 



Another a70-based system utilises recruitment of the polymerase to the promoter by way 
of a protein-protein interaction between a protein domain fused to the RNA polymerase 
asubunit and another fused to the lambda repressor bound immediately upstream of the 
RNA polymerase promoter binding site (Dove S.L.. Joung J.K.. Hochschild A. (1997), 
Nature 386. 627). By replacing the lambda repressor DNA binding domain with a library 
of Zn-fmger domains, specific DNA binding Zn-fmger domains were selected (Joung J.K., 
Ramm E.l. Pabo CO. (2000) Proc Nail Acad Sci USA, 97. 7382) 

The alternative holoenzyme form of bacterial RNA polymerase (RNAP) contains the a 54 
factor (a 54-RNAP). As has been previously shown, this polymerase, in most cases, forms 
a closed complex with the promoter. Unlike a70 promoters at which the RNA polymerase 
is bound in an active form and is largely controlled by repression, the a54 RNA 
polymerase holoenzyme is transcriptionally incompetent and is unable to initiate 
transcription by itself. Initiation of transcription requires the presence of a transcriptional 
activator that catalyses the isomerisation of the closed promoter complex to an open one. 
Typically, activator proteins bind to a specific upstream activation sequence (UAS) located 
80 to 200 bp upstream of the a 54 core promoter. The function of the UAS is to tether the 
activator in the right position and to bring it in the vicinity of the promoter in order to 
increase the efficiency of interaction between the a 54 RNAP and the activator. 
Transcriptional activators of a54 dependent promoters have been called bacterial 
enhancers because their mechanism of activation is superficially similar to the activation of 
transcription by enhancer proteins in eukaryotes (Kustu S. et al (1991) Trends Biochem Sci 
16, 397). 

Conversion of the a 54 RNAP into an active form is catalysed by the binding of an 
enhancer protein coupled to hydrolysis of ATP. This unusual mechanism accounts for the 
low level of background transcription and the enormous difference (lOUO^) between on 
and off states in the strongest a54 promoters effected by a single factor. In comparison, 
activators of a70 promoters such as CAP or Xcl increase transcription levels usually by 
less than 10-fold. 
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Transcriptional activators of a54 promoters (also known as enhancer-binding proteins or 
EBPs) share a common structure (see Morrett and Segovia. (1993) J. Bacteriol. 6067- 
6074) comprising a non-conser^•ed N-terminal domain which has a putative regulatory 
function, a central domain which is responsible for transcriptional activation, and a C- 
5 terminal DNA binding domain which binds the relevant UAS in the target gene. The 
domains are modular: the central and N-terminal domains together are capable of 
constitutive activation of a54 RNAP when overexpressed. At least in some cases, the 
isolated DNA binding domain iscapableofspecificallybindingits DNA recognition site. 

10 In many instances, interaction between a54 RNAP and the activator is enhanced by a 
cellular factor which promotes DNA bending between the UAS and the a54 promoter 
(Freundlich et a/., (1992) Mol. Microbiol. 6:2557-2563). This factor, known as integration 
host factor (IHF) acts to promote transcription from a54 promoters. 

1 5 Summary of the Invention 

We provide herein a novel screening system which is based on transcriptional activators of 
o54-based promoters. 

20 According to a first aspect of die invention, therefore, there is provided a method for 
detecting a protein-nucleic acid interaction between a acid molecule and a protein 
molecule, comprising the steps of: 

a) providing one or more hybrid o54 activator proteins comprising a heterologous 
nucleic acid binding sequence and a constitutively active a54 transcription activating 

25 domain; 

b) providing one or more nucleic acid molecules comprising a binding site for the 
nucleic acid binding sequence and a binding site for a54 RNAP, which directs the 
expression of a reporter gene and leads to upregulation thereof in response to activation by 
the a54 transcription activating domain; and 

30 c) detecting expression of the reporter gene. 
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The invention provides a reporter system which is characterised by very low levels of 
background expression, since the a54 polymerase is transcriptionally incompetent in the 
absence of a a54 transcriptional activator. Since, at physiological concentrations, the 
binding of the transcriptional activator to the nucleic acid is required in order to activate 
5 transcription by o54 RNAP. the system of the invention may be used as a tool for 
investigating and/or screening protein/nucleic acid interactions exploiting the reporter gene 
read-out. 

In the first aspect of the invention, either die nucleic acid binding protein or the nucleic 
10 acid molecule may be provided in the form of a repertoire of molecules. Repertoires of 
hybrid (754 activator proteins preferably are partially or completely randomised at least in 
the heterologous nucleic acid binding sequence. This allows selection from the library of 
molecules having desired nucleic acid binding characteristics. 

15 Repertoires of nucleic acid molecules advantageously are partially or completely 
randomised in the binding site for the nucleic acid binding sequence of the o54 activator 
protein. This allows selection of nucleic acid molecules having desired binding sites for 
the chimeric activators. 

20 In a second aspect of the invention, there is provided a system for selecting protein-protein 
interactions based on the constitutively active hybrid a54 activators described above. The 
system according to the invention is conceptually similar to the yeast two-hybrid system. 

Accordingly, there is p«)vided a method for detecting a protein-pn)tein interaction, 

25 comprising the steps of: 

a) pn)viding a first hybrid protein comprising a nucleic acid binding sequence and a 

first polypeptide sequence bait; 

b) pn>viding a second hybrid protein comprising a prey polypeptide sequence and 

constitutively active <t54 transcription activating domain; 
30 c) providing a nucleic acid molecule comprising a binding site for the nucleic acid 

binding sequence and binding site for a54 RNAP which directs the expression of a 
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reporter gene and leads to upregulation thereof in response to activation by the a54 
transcription activating domain: 

d) incubating the first and second hybrid proteins together with the nucleic acid 
molecule such that the pr«y and bait polypeptide sequences may bind, thereby forming a 
hybrid protein comprising both a nucleic acid binding sequence and a a54 transcription 
activating domain; and 

e) detecting expression of the reporter gene. - 

As will be apparent to those skilled in the art, reference to a "binding site" for the nucleic 
acid binding sequence includes the provision of several appropriately spaced binding sites 
in the nucleic acid molecule. 

As with the yeast two-hybrid system, in which a modular transcription factor is assembled 
though binding of DNA binding domain/bait and transcription activating domain/prey 
hybrids, the association of the nucleic acid binding sequence and the a54 transcription 
activating domain through the bait/prey interaction allows the detection of, and screening 
for, protein-protein binding interactions in vivo and in vitro. Advantageously, the bait 
and/or prey polypeptide sequences are provided in the form of repertoires, which may be 
partiaUy or completely randomised. This allows selection of prey polypeptides based on 
their ability to form interactions with a desired bait (or vice versa). As the assay may be 
conducted in vivo, in a bacterium, the invention permits the detection of in vivo binding 
interactions between polypeptides in bacteria. 

It will be apparent that the hybrid pK)teins usefiil in the methods of the invention are 
advantageously pn)vided in the fonn of nucleic acid vectors or libraries thereof capable of 
expressing said proteins in a host bacterium. Advantageously, the vector(s) include first 
and second chimeric genes which encode the hybrid proteins of the mvention. Preferably, 
the vectors also include means for lepUcation in bacteria. Also included may be one or 
more marker genes, the expression of which in the bacterium permits selection of cells 
containing the vector(s) from cells that do not contain the vector(s). Preferably, the 
vector(s) are plasmid(s). 
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In a third aspect, the invention provides a method for screening a repertoire of 
candidate DNA-bending polypeptides, comprising the steps of: 

a) providing a repertoire of candidate polypeptide factors with potential to induce 

bending of DNA; 

b) providing a ct54 activator protein comprising a nucleic acid binding sequence 
and a a54 transcription activating domain; 

c) providing a nucleic acid molecule comprising a binding site for the nucleic acid 
binding sequence and binding site for c54 RNAP which directs the expression of a 
reporter gene and leads to upregulation thereof in response to activation by the o54 
transcription activating domain; 

d) incubating the repertoire and a54 activator together with the nucleic acid 
molecule in a fflF* host cell, such that o54 activator and the nucleic acid molecule may 
interact, and transcription activated from the o54 RNAP binding site in a manner 
dependent on DNA bending by the polypeptide factors; and 

e) detecting expression of the reporter gene. 

It is known that activation by a54 activators may be regulated by factors which induce 
DNA bending in the target gene. For example, tiie host factor IHF is known to potentiate 
o54 activation; moreover, it may be replaced by alternative DNA bending polypeptides, or 
by intrinsically bent DNA. 

The invention moreover provides methods for development of improved a54 activator- 
based tools. 

The first chimeric gene includes a nucleic acid sequence tiiat encodes a nucleic-binding 
domain and a first (bait) test protein or protein fragment in such a manner tiiat the first test 
protein is expressed as part of a hybrid protein with the nucleic acid-binding domain. 

The second chimeric gene also includes a promoter and a transcription termination signal 
to direct transcription. The second chimeric gene moreover includes a nucleic acid 
sequence that encodes a a54 transcriptional activation domain and a second (prey) test 
protein or protein fragment into the vector, in such a manner tiiat die second test protein is 
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capable of being expressed as part of a hybrid protein with the transcriptional activation 
domain. 

The invention moreover provides kits for practising the invention, which kits 
advantageously comprise a container, two vectors, and a host cell. The first vector contains 
a promoter and may include a transcription termination signal functionally associated with 
the first chimeric gene in order to direct the transcription of the first chimeric gene. The 
chimeric gene advantageously comprises one or more unique restriction site(s) to insert a 
nucleic acid sequence encoding a test bait polypeptide. The kit also may also include a 
second vector which contains a second chimeric gene, optionally comprising one or more 
unique restriction site(s) to insert a nucleic acid sequence encoding the prey polypeptide; 
alternatively, the second chimeric gene may be present on the same vector as the first 
chimeric gene. 

Brief description of the Figures 

Figure 1 A is a schematic representation of a54 RNAP activation by a54 activator NifA. 

Figure IB is a schematic representation of the invention, in which the a54 DNA binding 
domain is replaced with a heterologous.GCN4 DNA binding domain.. 

Figure 2 is a schematic representation of the first aspect of the present invention, in which 
a library of DNA binding domains is screened together with a library of DNA binding 
domain binding sites to identify proteiniDNA binding pairs. 

Figure 3 shows the activation of transcription by NifA-chimera as expressed as percent of 
wt activity (NifA/UAS). Nif-GCN4 (in presence of the NifAAC coactivator (NifADC)) 
show close to wt activity. Equal activity is observed for the two distinct GCNJ DNA 
recognition sites (ATF/Creb and AP-1). Less than 1% wt activity is observed with a non- 
cognate reporter such as one bearing the wt nifH UAS. 
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Figure 4 shows the activation of transcription by NifA-chimera as expressed as percent of 
wt activity (NifAAJAS). Nif-ERDBD (in presence of the NifAAC coactivator (NifADC)) 
shows ca. 80% of wt activit>'. Ver>' little activation is observed with a non-cognate reporter 
bearing the DNA recognition site (ORE) for the closely related Glucocorticoid receptor. 

Figure 5 shows the coactivation by different NifA variants as expressed as percent of wt 
activity (NlfAwt). NifA from Klebsiella pneumoniae (NifAKp) is superior to al others, 
even exceeding wt activity (up to 160%). NifAKp with its DNA domain deleted 
(NifAACKp (Nl£/^DCKp)) is almost as active. 



10 



Detailed Description of the Invention 

UMess defined otherwise, all technical and scientific terms used herein have the same 
15 meaning as commonly understood by one of ordinary skill in the art (e.g., in bacterial cell 

culture, molecular genetics, nucleic acid chemistry, protein chemistry and biochemistry). 

Standard techniques are used for molecular, genetic and biochemical methods (see 

generally, Sambrook et al. Molecular Cloning: A Laboratory Manual, 2d ed. (1989) Cold 

Spring Harbor Laboratory Press, Cold Spring Harbor. N.Y. and Ausubel et al.. Short 
20 Protocols in Molecular Biology (1999) 4* Ed, John Wiley & Sons. Inc. which are 

incorporated herein by reference), chemical methods, pharmaceutical formulations and 

delivery and treatment of patients. 



25 



A: Nucleic Acids and Proteins 

As used herein, "nucleic acid" refers to any natural nucleic acid, including RNA and DNA 
as well as synthetic nucleic acid comprising modified or synthetic bases, and mixtures of 
modified or synthetic bases with natural bases. Such modified and/or synthetic bases may 
be referred to as derivatives of DNA or RNA. Preferably, "nucleic acid" refers to DNA. 



30 
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The invention includes the use of modified and/or anificial ^-nucleic acids A number of 
modifications have been described that alter the chemistry of the phosphodiester 
backbone, sugars or heterocyclic base components of nucleic acids. 

5 Among useful changes in the backbone chemistry are phosphorothioates; 
phosphorodithioates. where both of the non-bridging oxygens are substituted with 
sulphur; phosphoroamidites: alkyl phosphotriesters and boranophosphates. Achiral 
phosphate derivatives include 3*-0'o'-S-phosphorothioate, 3'-S-5*-0-phosphorothioate, 
3'-CH2-5*-0-phosphonate and 3'-NHo*-0-phosphoroamidate. Peptide nucleic acids 

10 replace the entire phosphodiester backbone with a peptide linkage. 

Sugar modifications are also known. The a-anomer of deoxyribose may be used, where 
the base is inverted with respect to the natural p-anomer. The 2*-0H of the ribose sugar 
may be altered to form T-O-methyl or 2*-0-allyl sugars, which provides resistance to 
1 5 degradation without comprising affinity. 

Modification of the heterocyclic bases must maintain proper base pairing. Some useful 
substitutions include deoxyuridine for deoxythymidine; 5-methyl-2'-deoxycytidine and 
5-bromo-2'-deoxycytidine for deoxycytidine. 5-propynyl-2'-deoxyuridine and 
20 5-propynyl-2'-deoxycytidine have been shown maintain biological activity when 
substituted for deoxythymidine and deoxycytidine, respectively. 

As used herein, the term "protein" includes single-chain polypeptide molecules as well as 
multiple-polypeptide complexes where individual constituent polypeptides are linked by 

25 covalent or non-covalent means. As used herein, the terms "polypeptide" and "peptide" 
refer to a polymer in which the monomers are amino acids and are joined together through 
peptide or disuiphide bonds. The term domain also refers to polypeptides and peptides 
having biological function. A peptide useful in the invention will have a binding or 
transcription activating capability, i.e., with respect to binding to nucleic acids, other 

30 proteins or polypeptides, and activation of a54 RNAP transcription. It also may have 
another biological function that is a biological furiction of a protein or domain fiom which 
the peptide sequence is derived. 
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A hybrid protein is a protein or polypeptide which comprises constituent parts derived 
from at least two naturally-occurring or artificial proteins. In particular, it may comprise 
the DNA-binding domain of one protein and the protein-binding or transcription activating 
5 domain of a second protein. 

B: a54 Activators 

Activators of a54 transcription are well known and have been reviewed, for example, by 
10 Buck et ai, J Bacteriol. 2000 Aug;182(l5):4129-36; Studholme and Buck, FEMS 

Microbiol Lett. 2000 May l;186(l):l-9; Shingler, Mol Microbiol. 1996 Feb;19(3):409-16; 

Goosen and van der Putte, Mol Microbiol. 1995 Apr, 16(1): 1-7; Merrick, Mol Microbiol. 

1993 Dec;10(5):903-9; and others. A family of such activator proteins has been defined, 

and its members found to share homology in the central (catalytic) domain which is 
1 5 responsible for a54 RNAP activation. 

Members of the family include the following (the numbers are GenBank accession 
numbers) 

20 dbjlBAA16379.1| (D90877) FORMATE HYDROGENLYASE TRANSCRIPTIONAL 
ACTIVATOR. 

emb|CAA26472.1| (X02616) pot. NifA gene product (aa 1-484) [Klebsiella pneumoniae] 

emb|CAA53584.1| (X75972) anfA [Rhodobacter capsulatus] 

emb|CAA92413.1| (Z68203) NifA homologue [Rhizobium sp.] 
25 emb|CAA93242. 1| (Z69251) MopR [Acinetobacter calcoaceticus] 

emb|CAB53 157. 1| (X07567) NifAl [Rhodobacter capsulatusj 

emb|CAB56537.1| (AJ249642) response regulator [Pseudomonas stutzeri] 

gb|AAA58220.1| (U 18997) ORF_o532 [Escherichia coli] 

gb|AAA99303.1| (L43064) regulatory protein [Pseudomonas aeruginosa] 
30 gb|AAB91 397. 1 1 (AF033203) NifAII protein [Rhodobacter capsulatus] 

gb|AAC05586.1| (AF006075) regulatory protein [Bacillus subtilis] 

gb|AAC37l24.11 (LSI 176) FleQ [Pseudomonas aeruginosa] 
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gb!AAC45640.l| (AF010583) putative sigma 34 activator (Caulobacter crescentus] 
gb|AAC46367.l| (AF0141 13) two-component response regulator [Vibrio cholerae] 
gb|AAD3459l.l|AF145956_l (AF145956) transcriptional activator NifA 
[Rhodospirillum rubrum] 
5 gb|AAD384l6.1| (AF155934) NifA [Alcaligenes faecalis] 
eb|AAF28395.1| (AF069392) FlaM [Vibrio parahaemolyticus] 

eb|AAF33506.1| (AF170176) Salmonella typhimurium transcriptional regulatory protein 
gb|AAF61932.1| (AF230804) sigma-54 activator protein Actl [Myxococcus xanthus] 
gb|AAF85342.1|AE004061_7 (AE004061) two-component system, regulatory protein 

10 [Xylella &sddiosa] 

gb|AAF94676.1| (AE004230) sigma-54 dependent response regulator [Vibrio cholerae] 
gblAAF95280.1| (AE004286) sigma-54 dependent response regulator [Vibrio cholerae] 
gb|AAF96095. 1 1 (AE004358) signM-54 dependent transcriptional regulator [Vibrio 
cholerae] 

15 gb|AAG0l527,l|AF288483_l (AF288483) NifA [Azospirillum brasilense] 
pir||A48291 ornithine decarboxylase inhibitor - Escherichia coli 
pir||B49940 nitrogen regulator I homolog - Escherichia coli 
pirilC70320 transcription regulator NifA family - Aquifex aeolicus 
pir||C70396 transcription regulator NtrC family - Aquifex aeolicus 

20 pir||C70454 transcription regulator NtrC family - Aquifex aeolicus 
pir|p70315 transcription regulator NtrC family - Aquifex aeolicus 
pir||H69581 transcription activator of acetoin dehydrogenase operon acoR - Bacillus 
subtilis 

pirl|I39719 nitrogen regulatory protein - Agrobacterium tumefaciens 

25 pir|| JC547 1 regulatory protein NifA - Azospirillum lipofenun 

pir||T08624 probable NtrC-type response regulator - Eubacterium acidaminophilum 
sp|P03027|NIFA_KLEPN NIF-SPECIFIC REGULATORY PROTEIN 
sp|P09570[NIFA_AZOVI NIF-SPECIFIC REGULATORY PROTEIN 
sp!P126271VNFA_AZOVI NITROGEN FIXATION PROTEIN VNFA 

30 splP 14375|HYDG_ECOLI TRANSCRIPTIONAL REGULATORY PROTEIN HYDG 
sp|P21712|YFHA_ECOLI HYPOTHETICAL 49.1 KD PROTEIN IN GLNB-PURL 
INTERGENIC REGION (ORFXB) 
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sp!P24426|NlFA_RHILTNIF-SPEClFIC REGULATORY PROTEIN 
spiP258521HYDG_SALTY TRANSCRIPTIONAL REGULATORY PROTEIN HYDG 
sp|P277131NIFA_HERSE NIF-SPECIFIC REGULATORY PROTEIN 
sp|P30667|NIFA_AZOBRNIF-SPECIFIC REGULATORY PROTEIN 

5 sp|P38035|RTCR_ECOLI TRANSCRIPTIONAL REGULATORY PROTEIN RTCR 
sp|P54929|NIFA_AZOLI KIF-SPECIFIC REGULATORY PROTEIN 
splP56266!NIFA_KLEOX NIF-SPECIFIC REGULATORY PROTEIN 
sp|Q06065|ATOC_ECOLI ACETOACETATE METABOLISM REGULATORY 
PROTEIN ATOC (ORNITHINE/ARGININE 

10 sp|Q46802|YGEV_ECOU HYPOTHETICAL SIGMA-54-DEPENDENT 
TRANSCRIPTIONAL REGULATOR IN 

sp|Q53206|NIFA_RHISN NIF-SPECIFIC REGULATORY PROTEIN 
sp|Q9ZIB7|TYRR_ERWHE TRANSCRIPTIONAL REGULATORY PROTEIN TYRR 

15 Moreover, a number of polypeptides belonging to the o54 activator femily have been 
described whose 3D structures are known. These include: 113161 acetoin cataboUsm 
regulatory protein; 113629 alginate biosyntiiesis transcriptional regulatory protein ALGB; 
266789 type 4 fimbriae expression regulatory protein PILR; 113833 nitrogen fixation 
protein ANFA; 138884 nitrogen fixation protein VNFA; 128219 nif-specific regulatory 

20 protein; 3024194 nif-specific regulatory protein; acetoacetate metabolism regulatory 
protein ATOC 1168553 (omithine/arginine decarboxylase inhibitor) (omitiiine 
decarboxylase antizyme); 417166 transcriptional regulatory protein HYGD; 266622 nif- 
specific regulatory protein; 1352500 nif-specific regulatory protein; 128224 nif-specific 
regulatory protein; 128225 nif-specific regulatory protein; 128221 nif-specific regulatory 

25 protein; 128226 nif-specific regulatory protein; 1346014 transcriptional regulatory protein 
FLBD; 549560 hypothetical sigma-54-dependent transcriptional regulator in GUTQ-HYPF 
intergenic region; 139857 transcriptional regulatory protein XYLR (67 kd protein); 120053 
formate hydrogenlyase transcriptional activator; 2507375 hypotiietical 49.1 kd protein in 
GLNB-PURl intergenic region (ORFXB) (orf-2); 134961 signal-transduction and 

30 transcriptional-control protein; 1171795 nitrogen assimilation regulatory protein; 417388 
nitrogen regulation protein nr(i); 123466 hydrogenase transcriptional regulatory protein 
HOXA; 399925 hydrogenase transcriptional regulatory protein HOXA; 585586 nitrogen 
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assimilation regulatorv' protein NTRX: 118399 c4-dicarboxylate transport transcriptional 
regulatory protein DCTD: 585267 pathogenicity locus probable regulatory protein HRPR; 
1346313 pathogenicity locus probable regulatory protein HRPS; 549447 pathogenicity 
locus probable regulatory protein \VTSA: 585909 arginine utilization regulatory protein 
ROCR; 136600 transcriptional regulator>' protein TYRR; 1174836 transcriptional 
regulatory protein TYRR homolog: 123748 hydrogenase transcriptional regulatory protein 
HUPRl: 128604 nitrogen regulation protein NTRC; 1 169293 glycerol metabolism operon 
regulatory protein; and 129957 phosphoglycerate transport system transcriptional 
regulatory protein PGTA . The numbers are GenBank gi numbers. 



10 



Preferably, the hybrid o54 activator is based on the NifA activator. The Nif family of 
bacterial enhancers regulate expression of nitrogenase components from a54 promoters in 
nitrogen-fixing bacteria, and are inhibited by NifL (Austin S. et al (1994) J. Bacteriol. 176. 
3460). In bacteria lacking NifL, NifA is constitutively active. NifA is modular in 
15 architecture and it is shown herein that this allows for the swapping of the natural DNA- 
binding domain (DBD) for heterologous DBDs. Such Nifi\-DBD chimaeras are inactive 
on the wild type promoter, but activate transcription ftom hybrid promoters bearing their 
cognate target sequences. 

20 Advantageously, the hybrid a54 activator may be based on E. colt PspF (see Jovanovic et 
al, (1996) J. Bacteriol. 178:1936-1945). PspF lacks the N-terminal regulatory domain 
. typical of (t54 activators, and is constimtively active but negatively regulated by PspA. 
Thus, in bacteria lacking PspA, PspF is constitutively active. 

25 Other a54 activators may be rendered constitutively active by removal of tiie N-terminal 
regulatory domsun or by appropriate mutation. 

Nucleic acid binding sequences or domains arc known in die art and may be derived from 
a54 activator proteins or any odier DNA binding proteins, whether naturally-occurring or 
30 synUietic. Moreover, DNA-binding domains may be synthesised by partial or complete 
randomisation. Many naturally-occurring DNA-binding proteins contain independcntiy 
folded domains for tiie recognition of DNA, and these domains in turn belong to a large 
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number of structural families, such as the leucine zipper, the homeodomain, the "helix- 
tum-helix", the zinc finger and various other transcription factor families. 

C: Libraries 

The term library refers to a mixture of heterogeneous polypeptides or nucleic acids. The 
library is composed of members, which have a unique polypeptide or nucleic acid 
sequence. To this extent, library is sv-nonymous with repertoire, although in general the 
term "library" is used herein to denote the source of the repertoire - e.g. a library of 
nucleic acid molecules which encodes a repertoire of polypeptides. Sequence differences 
between librar>' members are responsible for the diversity present in the library. The 
library may take the form of a simple mixture of polypeptides or nucleic acids, or may be 
in the form organisms or cells, for example bacteria, viruses, animal or plant cells and the 
like, transformed with a library of nucleic acids. Advantageously, the nucleic acids are 
incorporated into expression vectors, in order to allow expression of the polypeptides 
encoded by the nucleic acids. In a preferred aspect, therefore, a library may take the form 
of a population of host organisms, each organism containing one or more copies of an 
expression vector containing a single member of the library m nucleic acid form which 
can be expressed to produce its corresponding polypeptide member. Thus, the population 
of host organisms has the potential to encode a large repertoire of genetically diverse 
polypeptide variants. 

Libraries of hybrid proteins may be prepared and selected together with libraries of hybrid 
nucleic acids. "Crossing" of hybrid libraries is performed by combinatorial infection, 
which has been employed successfully to generate very large antibody libraries (Griffiths 
et al (1994) EMBO J. 13, 3245). 

Although libraries for use in the present invention may be phage libraries, as is known in 
the art, it is possible to use alternative libraries which are constructed using otiier vectors, 
such as plasmids. hi any case, tiie present invention does not require die library to be 
ci^jable of "display" of die gene product at tiie bacterial surface, as with phage libraries; 
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rather, the gene product is preterable expressed intracelluiarly. and is advantageously not 
expressed as a fusion with a vector gene product. 

DNA binding domain libraries are preferably based on a known DNA binding domain 
5 architectures (e.g. basic leucine zipper, bZIP) and may be derived using PCR 
amplification with "family-specific" primers. Such libraries may be crossed with hybrid- 
promoters bearing defined target sequences or libraries of target sequences. In addition to 
providing information on the distribution of members of the family in a given genome, 
such libraries may be used to identify and study proteins or molecular compounds that 
10 modify DNA interaction within a family of DNA binding domains, for example Tax 
(from HTLV- 1 ) in the case of bZIP proteins. 

In an alternative embodiment, they may also be used to select DNA binding domains 
which conditionally bind their target sequence only in the presence of other factors such 

15 as protein cofectors or small molecular compounds, for example drugs that intercalate 
into DNA or alter the degree of supercoiling or recognise DNA sequences which have 
been modified chemically (e.g. methylated). The system can also be used "in reverse" i.e. 
to select proteins or molecular compounds that disrupt a particular DNA-protein 
interaction or to select DNA binding domains that do not bind a particular target sequence 

20 or library thereof. 

More advanced libraries are preferably derived directly from genomic DNA or cDNA 
libraries and selected on hybrid promoters bearing a repertoire of target sequences, 
comprismg either a stretch of randomised sequence or a library of inserts derived from 
25 fragmented genomic DNA. Data obtained in this way allows the compilation of a genomic 
directory of DNA binding domains and the building of a promoter-DNA binding domain 
interaction map. 

D: Hybrid polypeptides 

30 

The generation of hybrid polypeptides by domain fiision is well known in the art and may 
be effected by fusing polypeptides or, preferably, by fusing nucleic acids which encode 
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the polypeptides. It has been knovm since 1976 that DNA binding and transcriptional 
activator domains are separable, and can be swapped between proteins; see Ma and 
Ptashne, who reported (Cell, (1987) 51, 113-119; CelL (1988)55, 443-446) thai when both 
the GAL4 N-terminal domain and C-terminal domain are fused together in the same 

5 protein, transcriptional activity is induced. Other proteins are also known function as 
transcriptional activators via the same mechanism. For example, the GCN4 protein of 
Saccharomyces cerevisiae as reported by Hope and Struhl, CelL 46, 885-894 (1986), the 
ADRl protein of Saccharomyces cerevisiae as reported by Thukral et al., Molecular and 
Cellular Biology, 9, 2360-2369, (1989) and the human estrogen receptor, as discussed by 

10 Kumar et al„ Cell, 51, 941-951 (1987) both contain separable domains for DNA binding 
and for maximal transcriptional activation. 

The same is specifically known of the 054 bacterial transcriptional activators, although a 
genetic screen based thereon has not been proposed. Therefore, the present invention may 
1 5 be carried out using techniques which are known to those skilled in the art, particularly as 
applied to 2-hybrid techniques in eukaryotic cells. 

Synthesis of chimeric genes for the purposes of the present invention may be carried out 
by any desired means, including polynucleotide synthesis and mutagenesis approaches. 

20 For example, a number of methods for site-directed mutagenesis are known in the art, 
from methods employing single-stranded phage such as Ml 3 to PCR-based techniques 
(see "PCR Protocols: A guide to methods and applications", M.A. Innis, D.H. Gelfand, 
J.J. Sninsky, T.J. White (eds.). Academic Press, New York, 1990). Preferably, the 
commercially available Altered Site II Mutagenesis System (Promega) may be employed, 

25 according to the directions given by the manufacturer. 

E: Host Cells 

Host cells useful in conjunction with the present invention are prokaryotic cells, 
30 advantageously bacterial cells. E. coli is the preferred host; however, host cells may 
belong to any species or genus in which a54 RNAP-driven transcription is possible, such 
as Klebsiella, Rhodobacten Rhizobium, Acinetobacter, Pseudomonas. Escherichia, 



wo 01/18244 



PCT/GBOO/03450 



17 

Bacillus, Caulobacter Vibrio, Rhodospirillum. Alcaligenes. Salmonella, Myxococcus, 
Xylella, Azospirillum, Aquifex, Agrobaaerium and other organisms. In E.coli, the 
preferred configuration is a modified strain, in which a truncated form of Nif (or another 
activator) is coexpressed to boost specific activation (see Methods). 

5 

Preferably, the host cells lack repressors of the a54 activator being used, such that the 
transcription activating domain is constitutively active. Repressors may be deleted by 
genetic mutation and/or selection, or inhibited by expression of antisense constructs, or 
the like. In general, due to the accessibility of bacterial genetics, especially in £ coliy 
10 deletion of repressor genes is smiightforvvard to those skilled in the an. 

F: Reporter Genes 

Reporter genes of various types are known in the art and may be used in conjunction with 
15 the present invention. A "reporter gene", as referred to herein, may be the coding 
sequence which encodes a detectable gene product, or the coding sequence including the 
necessary control sequences for its expression in accordance with the invention, as 
appropriate. 

20 Advantageously, the reporter gene is selected from the group consisting of metabolic 
markers such as the lac operon (lacZ, lacY and lacA); proteins conferring a fluorescent 
phenotype, such as GFP; proteins conferring antibiotic resistance, such as Zeo; and 
proteins conferring another selectable property. 

25 Certain reporters, such as the LacZ gene, are widely used in bacterial genetics and are 
usefiil in the performance of the invention. However, other genes may also be employed, 
including fluorescent proteins. For example, green fluorescent proteins (GFPs) of 
cnidarians, which act as their energy-transfer acceptors in bioluminescence, can be used in 
the invention. A green fluorescent protein, as used herein, is a protein that fluoresces 

30 green light, and a blue fluorescent protem is a protein that fluoresces blue light GFPs 
have been isolated from the Pacific Northwest jellyfish, Aequorea victoria, from the sea 
pansy, Renilla reniformis, and from Phialidium gregarium. (Ward et al., 1982, 
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Photochem. PhotobioL , 35: 803-808: Levine et al., 1982. Comp. Biochem, Phvsiol.J2B: 
77-85). 

A variety of Aequorea-tQlBicd GFPs having useful excitation and emission spectra have 
5 been engineered by modifying the amino acid sequence of a naturally-occurring GFP from 
Aequorea victoria. (Prasher et al., 1992, Gene; 111: 229-233; Heim et al., 1994, Proc, 
NatL Acad Sci. U.S.A. . 91: 1250M2504; PCT/US95/14692). As used herein, a 
fluorescent protein is an Aequorea-related fluorescent protein if any contiguous sequence 
of 150 amino acids of the fluorescent protein has at least 85% sequence identity with an 
10 amino acid sequence, either contiguous or non-contiguous, from the wild-type Aequorea 
green fluorescent protein (SwissProt Accession No. P42212). Similarly, the fluorescent 
protein may be related to Renilla or Phialidium wild-type fluorescent proteins using the 
same standards. 

15 yie^worea-related fluorescent proteins include, for example, wild-type (native) Aequorea 
victoria GFP, whose nucleotide and deduced amino acid sequences are presented in 
GenBank Accession Nos. L29345, M62654, M62653 and others Aequorea-xtXzifid 
engineered versions of Green Fluorescent Protein, of which some are listed above. Several 
of these, i.e., P4, P4.3, W7 and W2 fluoresce at a distinctly shorter wavelength than wild 

20 type. 

A specific advantage of fluorescent proteins is that they facilitate FACS sorting of cells in 
a manner dependent on reporter gene expression (Norman, S.O. (1980). Flow cytometry. 
Med Phys. 7, 609-615; Mackenzie, N.M. & Pinder, A.C. (1986). The application of flow 
25 microfluorimetry to biomedical research and diagnosis: a review. Dev. Biol Stand 64, 
181-193). 

Other reporter genes may complement auxotrophic mutations, confer antibiotic resistance 
or other selectable characteristics to the host bacteria. Reporter genes may be wholly or 
30 partly heterologous to the host cell, and introduced by mutagenesis and/or transformation 
with appropriate vectors. Alternatively, endogenous a54-responsive genes may be used 
as reporter genes. 
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The reporter gene also contains a binding site for a54 RNAP. The consensus sequence 
for a54 RNAP binding is 5' TGGCAC-N5-TTGCa/t 3'. This sequence is located at -12 to 
-24 with respect to the start of transcription, whilst the more common sigma 70 
5 recognition sequence is situated at -10 to/ -35. Both the GO & GC must be on the same 
face of the DNA helix. 

In order to increase specificity, combinations of two or more reporter genes (preferably in 
tandem) may be used. 

10 

Where the reporter gene is chimeric, i.e. comprises heterologous binding sites for the 
nucleic acid binding sequence and a54 RNAP binding sites incorporated into the same 
nucleic acid, the spacing between the a54 RNAP binding site and the nucleic acid binding 
sequence binding site is preferably conserved with respect to the natural gene from vAdch 
15 the a54 RNAP binding site is taken. Advantageously, the spacing is at least calculated 
such that the spatial relationship of the elements on respective faces of the nucleic acid 
helix is maintdned. 

Reporter genes advantageously comprise a binding site for a further activation factor, such 
20 as IHF. These factors are believed to induce bending of the DNA, thus potentiating 
activation of <754 RNAP-driven transcription by a54 activators. Alternatively, the DNA 
itself may be intrinsically bent, thus providing constitutive potentiation of a54-specific 
activation. 

25 G: Configurations of the Invention 

The present invention may be configured in three basic ways: a first configuration, in 
which reporter gene activation is dependent on the interaction between the nucleic acid 
and a nucleic acid binding domain on the hybrid protein; a second configuration, in which 
30 reporter gene activation is dependent on interaction between bait and prey polypeptides 
which serves to bring together two or more components of the hybrid protein; and a thurd 
configuration, in which reporter gene activation by a a54 activator is dependent on the 
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presence of DNA-bending polypeptides. As referred to herein, an interaction is 
advantageously a binding interaction. 

Where the invention is configured to detect protein-nucleic acid interaction, libraries of 
proteins and/or nucleic acids may be prepared as described above. Proteins having 
improved nucleic acid binding, or nucleic acid sequences having improved affinity for 
protein domains, may be developed by mutagenesis and selection of candidate sequences. 
Alternatively, protein and/or nucleic acid sequences may be used to identify, in vivo, 
cognate binding partners. 

Chimeric a54 activators also offer the opportunity to better understand aspects of the 
process of transcriptional activation at a54 promoters. In the case of NifA, t is known that 
binding of the target sequence together with ATP binding promotes oligomerisation of 
NifA. It is believed that it is the oligomer which contacts the polymerase and catalyses the 
ATP-driven isomerisation of the polymerase holoenzyme. Taking advantage of the 
superactivation effect described above it may be possible to address questions such as 
which components of the oligomer (e.g. the DNA-bound NifA vs. Nifi\AC, i.e. NifA with 
the DNA binding dommn removed), which are contacting the polymerase and/or coi4)ling 
ATP hydrolysis to transcriptional activation etc. Furthermore, usage of NifAAC cofactors 
from different species (together with their diversification by PGR shuffling) allows 
identification of the sequence regions critical for transcriptional activation and a 
"maturation" of the NifAAC coactivator. Indeed, we have found the NifA from K 
pneumoniae to be a superior cofactor to A. vinelandii NifA. Finally, it may be possible to 
use chimera of a known DNA binding domains (e.g. GCN4) and a cDNA library as a 
prokaryotic "enhancer" trap, to isolate o54 activators on a genome-wide scale. 

Configuration of the invention to detect protein-protein interactions follows the general 
scheme of the yeast two-hybrid assay, and the reagents used in the invention may be set 
up accordingly. In general, therefore, the invention will comprise a nucleic acid binding 
domain-bait fusion, and a prey-a54 activator domain fusion. Although, in general, "bait" 
refers to a known polypeptide and "prey" to an unknown polypeptide, the terms may be 
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used interchangeably. Indeed, the invention comprises configuraiions in which both bait 
and prey are known, or both are unknown. 

Binding between the bait and the prey result in constitution of a hybrid protein which 
5 comprises both a nucleic acid binding domain and a ct54 activator domain. The hybrid 
protein is able to activate transcription from a reporter gene, thus providing a baitiprey 
binding-dependent signal. 

Protein-protein interactions may be selected using the preferred NifA system, in which the 
10 hybrid a54 transcriptional activator includes the NifA activation domain. The NifA 
bacterial two-hybrid system may be used for the generation of interaction matrices between 
cDNA libraries. Ultimately such interaction matrices may yield an interaction map of the 
proteins of an organism. The invention provides an alternative to the yeast two hybrid 
system. 

15 

Systems based on a54 have a number of advantages over the other systems that are 
available, e.g. the conceptually similar yeast one and two-hybrid system. A bacterial host 
allows substantially larger repertoires to be obtained and thus a much larger molecular 
diversity to be screened. In particular, using combinatorial mfection, the system of the 
20 invention allows the "crossing" of both a54-chimera repertoires y/ith libraries of hybrid 
reporter constructs, thus permitting coevolution of DNA binding domains, and recognition 
sites, or coselection of DNA binding domains and target sites from genomic libraries. 

Because selection in the a54-based system is based on a positive readout, i.e. activation of 
25 transcription, it is less prone to false positives than other approaches relying on the 
inhibitory effect of the expressed DNA binding domains, like the transcription interference 
assay (Elledge S.J. et al (1989) Proa Nat. Acad Sci 3689). In vivo selection in 

general may result in the selection of novel DNA binding domains that are more attuned to 
working under realistic conditions, including supercoiling of the recognition site, presence 
30 of a large excess of chromosomal DNA and high protein concentration. Another 
advantage of the system of the invention is that extremely low levels of the hybrid protein 
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appear to be sufficient to affect maximum activation of transcription. This is particularly 
helpfiil in the case of DNA binding domains that are prone to segregation. 

The CT54-based systems of the present invention may be further adapted to take into 
5 account potential disadvantages of bacterial expression. For instance, E. coli expression 
may be suboptimal for large eukaryotic transcription factors. However, large eukaryotic 
proteins can often be split into smaller domains which retain function and are usually 
readily expressed in E. coli. 

10 According to the third configuration, a constitutively active ct54 activator may be used to 
screen a library of candidate DNA-bending polypeptides, preferably in a HIF negative host. 
Since the degree of activation by the a54 activator may be dependent on DNA bending by 
additional factors, the levels of expression of the reporter gene will be modulated by the 
DNA-bending activity of the candidate DNA-bending polypeptides. 



15 



20 



The invention is fimher described, for the purposes of illustration only, in the following 
examples. 

Examples 



NifA from A. vinelandii is a well-studied member of the family of bacterial enhancers and 
it is a positive regulator of the expression of nitrogenase components in diazotrophs. It is 
inhibited by NifL in response to the presence of oxygen or ammonia. When expressed in E. 
coli, which lacks endogenous NifL or an equivalent, NifA is constitutively active. Because 
25 of the highly conserved nature of the activation mechanism of a54 RNA polymerase, NifA 
is a very strong activator of transcription in E. coli. 

Like other members of the family of bacterial enhancer proteins. NifA is modular in 
architecture, both structurally and functionally, comprising 3 domains, a N-terminal sensor 
30 domain , a central activation domain (AD), and a C-terminal DNA binding domain (DBD). 
The central activation domain (AD) can activate transcription independent of DNA 
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binding if overexpressed. Thus the DBDs function appears to be primarily to increase the 
Activator domain's concentration in the promoter proximit\'. 

We have exploited the modularity of the enhancer structure and swapped the natural NifA 
5 DNA binding domain (DBD) for heterologous DBDs and libraries thereof Here we 
describe the activity of these NifA-chimeras in the activation of transcription from the a54 
dependent promoter nifH and hybrids thereof 

Materials & Methods 

10 

Media & Reagents 

2xTY, MacConkey agar are described elsewhere (Miller J.H. (1972) Experiments in 
molecular genetics. Cold Spring Harbour, NY). Antibiotics were used at the following 
concentrations: AmpiciUin 0.1 mg/ml. Chloramphenicol 10ng/ml, Streptomycin 25ng/ml. 

15 Min-lac medium was essentially M9 medium (Sambrook et ai. Molecular Cloning: A 
Uboratory Manual, 2d ed. (1989) Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor. N.Y) supplemented with ImM MgS04, 20jiM CaCb, 2% (w/v) lactose, 2mg/ml 
casaraino acids, 40tig/ml L-tryptophan, Sfig/ml thiamine and appropriate antibiotics. Min- 
lacX plates where essentially M9 plates supplemented 2% lactose, appropriate antibiotics 

20 and 40ng/ml X-gal (5-brorao-4-chloro-3-indolyi-b-D-galactopyranoside). 



Strains 

TGIAK was derived from TGI (Gibson T. J. (1984) Studies on the Epstein-Barr virus 
genome. University of Cambridge) using the genome integration strategy of Haldimann A. 

25 et ai, (1996) Proc. Nat. Acad. Sci USA 93, 14361. Briefly, NifA {K. pneumoniae) 
residues 1-462 was amplified using Pfu polymerase (Stratagene) and primers I (5'- GAG 
TCA CTA ACG CAT ATG ATC CAT AAA TCC GAT TCG GAC -3'), 2 (5'- CGC GGA 
TCC AAG CGG CCG CTC ATT AGC GAT GGT TGA ACA GAA TCA C -3') cut with 
Ndel and BamHI and cloned into the genome targeting suicide vector pSK50D-uidA2 

30 (Haldimann, Op. Cit.) and transformed into the Pir+ host strain BW23473 (Metcalf W.W. 
et al (1994) Plasmid 35, 1). Vectors were isolated and transformed into the Pir strain TGI 
harbouring the plasmid pINT-ts (Hasan N. et al (1994) Gene 150, 51). Chromosomal 
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integration was induced by a temperature shift to ^l^C, which leads to expression of X 
integrase from pINT-ts and simultaneously stops its replication. Integrants where identified 
by Kanamycin resistance and screened for Nif coactivation. Once obtained TGIAK. was 
grown routinely without antibiotic selection. 

5 

Constructs 

Chimeric constructs were based on pDB737 (.Austin S. et al (1994) J. Bacteriol. 176, 3460 
Buck M. et al (1986) Nature 320, 374) encoding Nif.'\ {A. vinelandii) under the control of 

10 the T7 promoter in the plasmid pT7-7 (Tabor S. & Richardson C.C. (1985) Proc Natl 
AcadSci USA 82. 1074). Expression was by leakiness of the T7 promoter. Chimeras were 
constructed taking advantage of an unique Banll cutting site, in the linker region between 
the central domain of NifA and the DBD. GCN4 was amplified using Pfu polymerase 
(Stratagene) and primers 3 (S'- OCT GCC AGC GAG AGC CCG CCG CTC GCC GCG 

1 5 ATT GTG CCC GAA TCC AGT GAT CCT -3') and 4 (5'- GAG CTA AAG CTT TTA 
TTA GCG TTC GCC AAC TAA TTT CTT TAA TCT GGC -3') cut with Banfl and 
Hind3 and ligated into pDB737 cut with Banfl and Hind3. ERDBD was amplified using 
primers 5 (5'- GTC GAC AAC GAG AGC CCG CCG CTC GCC GCG GAA ACG CGT 
TAG TGC GCT GTT -3') TGC and 6 (5'- GGT CAG CGC GTG GAT CCT TAA CCA 

20 CCA CGA CGG TCT TTA CG-3') cut with Banll and BamHI and ligated into pDB737 
cut with Banfl and BamHI. The vector p737Sl is derived fi-om pDB737 by replacing the 
bla gene with aadA conferring streptomycin resistance and the insertion of a fl ph^e 
origin for packaging of the vector into filamentous phage particles. Briefly, aadA was 
amplified using primers 7 (5'- TCA GCG CAC GCT GAC GTC GTG GAA ACG GAT 

25 GAA GGC ACG AAC -3'), 8 (5'-CCG CCT GGA GGT GGC CAl TAT TTG CCG ACT 
ACC TTG GTG ATC TCG CC -3') and cut with Aatll and MscI and ligated with 
pDB737 cut with Aatll and Seal. The resuhing vector p737S was cut with AatIL Clal. The 
fl on was amplified using primers 9 (5'- GCT GCC GAC TCG ATC GAT GAA TGG 
CGA ATG GCG CCT GAT GCG G -3'), 10 (5'-CCG GGT CGT GAC GTC AGT GTT 

30 GGC GGG TGT CGG GGC TGG C -3') cut with AaUI, Clal and cloned into the cut 
p737S to give p737Sl. NifA-X chimera were transfened fix)m pDB737 to p737Sl by 
digestion with Ndel , Hind3 (BamHI for NifA-ERDBD). 
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Reporter constructs were derived from pACYC184 and the vector pMBl (Buck M. et al. 
(1986) Naiiire 320. 374). Briefly the lac-operon ( lacZ YA) was amplified with primers 1 1 
(5- GAG TCA ATT CGG GGA TCC CGT CGT TTT ACA ACG TCG TGA CTG G-3'), 

5 12 (5'- GAG TCA TTC TGG CCA GTC GAG CGC TCT GCC GGT GGT TAC -3") and 
cut with BamHI and Mscl. The nitH promoter segment from pMBl was amplified with 
primers 13 (5'- GAG TCA TTC AAG CTT GCG TGG AAT AAG ACA CAG GGG 
GCG-3'), 14 (5'- GAG TCA TTC GGG ATC CCC GGA TTT ACC GAT ACC GCC 
TTT ACC -3') and cut with Hind3, BamHI and the 2 fragments simultaneously ligated 

10 with pACYC184 cut with Hind3 and BsaAl to give pMB3. The fl on was amplified with 
primers 15 (5*- GCT GCC GAC TCG GCT AGC G.\A TGG CGA ATG GCG CCT GAT 
GCG G -3'), 16 (GCC GGG TCG CTT TAA AGT GTT GGC GGG TGT CGG GGC 
TGG C -3') and cut with Nhel and Dral and ligated into pMB3 cut with both Nhel, Xmnl 
to givepMB31. 

15 

Selection and screening 

Cells were cotransformed either by simultaneous or sequential electroporation wdth an 
expressor construct and a reporter construct and grown overnight with appropriate 
antibiotic selection at 340C in M9-lac medium and plated out. p-gal expression was scored 
20 either on MacConkey or Minlac-X-gal indicator plates or by ONPG enzyme assay of 
selected colonies (see below). 

Enzyme assay 

ONPG assays used to measure p-gal activity were essentially as described by Kolmar H. 

25 et al. (1995) EMBO J 14, 3895. Briefly, 20nl of an overnight culture is transferred to a 
microtitre well and lOOul of chloroform saturated Z-buffer (lOOmM NaHP04, ImM KCL, 
ImM MgS04, 50mM p-mercaptoethanol, pH 7.0 (Miller J.H. (1972) Experiments in 
molecular genetics. Cold Spring Harbour, NY) was added and the optical density at 600nm 
determined using an ELISA reader. Cells were lysed by addition of 50m1 Z-buffer with 

30 0.4% (w/v) SDS and incubated at 30oC for 10 min. 50jil of Z-buffer with 4mg/ml 0- 
nitrophenyl-P-D-galactopyranoside were added and the optical density at 420nM was 
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recorded automatically ever>- 15s over a period of 60min. Specific p^gal activit\' was 
calculated from the Vmax as in Miller (Op. Cit.). 

Example 1: NifA chimera with heterologous DNA binding domains activate 
transcription but only from promoters with a cognate recognition site 

To investigate in what way transcription activation by NifA was dependent on the NifA 
DNA binding domain (DBD) and on native nif promoter structure, we prepared NifA- 
chimeras in which NifA DNA binding domain (DBD) had been replaced by heterologous 
DBDs of diverse structural architectures. Initially we explored DBDs which, like the NifA 
wild type (wt) DBD bind to symmetrical DNA recognition sequences such as the basic 
leucine zipper (bZIP) DBD of the yeast transcription factor GCN4, the Zn-fmger domain 
of the human estrogen receptor DNA binding domain (ERDBD) and determined their 
capacity to activate transcription of a lacZ reporter gene in vivo from a hybrid nifH 
promoter, in which the NifA UAS had been deleted and replaced by recognition sites for 
the heterologous DBDs. 

In order to simplify comparison of transcription activation by NifA chimeras with 
activation by wt NifA, all reporter constructs had a single DNA recognition site. The vrt 
nifH promoter UAS contains three bona fide NifA recognition sites. Deletion of the two 
sites more distal to the promoter, however, did not appear to reduce transcription 
activation in our reporter under conditions tested. 

Transcription activation by NifA-chimeras was specific in that they only activated lacZ 
expression from hybrid-promoters bearing their cognate recognition sequences but not 
from control reporter constructs bearing wild type UAS or a non-cognate site (Fig. 3). In 
analogy to wt NifA the presence of two or more recognition sites (in phase, see below) did 
not increase activation by the Nif-GCN4 chimera. 

Activity was also dependent on die phasing of die recognition site widi respect to the 
promoter: when the symmetric ATF/CREB recognition site for GCN4 was offset in 
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increments of 1 bp, optimal activity was observed when the ATF/CREB was centred on 
the same bp as the symmetric \vi UAS. Presumably efficient contact with the RNA Pol 
holoenzyme requires that the activator be bound on the right face of the DNA. 

Transcription activation by NifA-chimeras appears to preserve fme specificity of isolated 
DBDs. Wi!d-i>T5e GCN4 binds with equal affinity to the symmetric ATF/CREB site as 
well as to the pseudo-symmetric AP-1 site. Indeed, the NifA-GCN4 chimera showed 
identical levels of transcription activation in reporter constructs with either of these sites 
(Fig. 3). A NifA-EElDBD chimera showed strong activity on a reporter with its cognate 
ERE site but no activit\- above background levels with reporters bearing the sinular GRE 
recognition site for the closely related glucocorticoid receptor DBD (Fig. 4). 

Example 2: Coexpression of wild-type NifA with NifA-chimeras boosts specific 
transcription activation by NifA chimeras in a specific and DNA independent manner 

The level of transcription activation by the Ni£^-GCN4 and NifA-ERDBD chimeras was 
lower (ca. 10%) than for wt NifA. However, near wt levels of acti\'it>' (up to 80%) were 
reached when wt NifA was coexpressed within the same cell as a "coactivator". 

20 The coactivation was independent of DNA binding, as NifA variants in which the DBD 
had been deleted (NifAAC) was found to be just as active as wt NifA. On the other 
coexpression of an isolated NifA central domain (both the DBD as well as the N-terminal 
sensor domain deleted (NifAANC)) failed to coactivate. NifA derived from different 
species showed greatly variable efficiencies as coactivators. NifA variants firom K. 

25 pneumoniae (NifA Kp, NifAAC Kp) were almost tiiree times as effective as NifA, while 
NifA variants from Rhizobium (NifA Rhl, NifA Rh2) were poorly active as coactivators 
(Fig. 5). 

The coactivator effect was found to enhance only specific transcriptional activation and not 
30 background levels of transcription firom promoters with non-cognate recognition sites. We 
therefore constructed an E. colt strain, expressing NifAAC Kp (the K. pneumoniae NifA 
with its DBD deleted) from a weak promoter (phoB) from tiie chromosome (TGl:AK). 
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The coactivation effect has analogies in eukarvoiic transcription, for example the enhancer 
Spl, in which isolated Spl activation domains can stimulate transcriptional activation by 
the DN A binding-form of Spl . a phenomenon termed "superactivation". 

5 

Example 3: Tethering of NifA chimera at die UAS is sufficient for activation, but strong 
activation requires correct positioning 

10 We also investigated transcription activation by NifA-chimeras with asynunetrical 
recognition sites such as the classic Zn-finger ZiG68 as well as the DBD from p53. 

Both NifA-ZiC68 and NifA-p53 chimeras activated transcription, but only at low levels (2 
- 5-fold above the background). However, when the Zif recognition site was duplicated, to 
15 give a symmetric palindromic site transcription activation increased substantially. Non- 
palindromic duplication of the recognition site in tandem did not increase activation. 

Thus while simple tethering is sufficient for some activation, only bipartite binding 
appears to give a strong activation. Presumably, tethering only leads to an approximate 
20 positioning of the activation domain with respect to the RNA polymerase holoenzyme, 
thereby reducing the likelihood of a productive interaction. 

Example 4: Selection of active NifA-chimeras by lac complementation 

Using expression of the lac operon (lacZYA) from our reporter construct as the read-out 
of transcription activation allows die selection of active NifA-chimera on the basis of 
metabolic complementation of a Alac strain, with lactose as the only carbon source. 
Initially we spiked populations of NifA-ERDBD with NifA-GCN4 at the ratios I/IO^, 
30 1/106 in the presence of the GCN4 cognate reporter ATF/CREB-nifH and grew 
populations overnight in minimal medium supplied with lactose. Pre- and post selection 
populations were scored by plating on MacConkey-lactose plates as well as by PCR 
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screening. The results are summarised in Table 1. Selection factors of up to 10,000 -fold 
per round were obsen^ed. . 

Table 1 :Seleciion factors for Nif selection by lac complementation 

5 

NifGCN4/NifERDBD Selection factor 

1/10^ 40 fold 

1/10^ 40 fold 

1/10^ 200 fold 

1/10^ 4000 fold 



Example 5: Selection of active NifA-chimeras by flow cytometry. 

10 Expression of |J-galactosidase (lacZ) as the read-out of transcription activation allows the 
selection of active NifA-chimera on the basis of metabolic complementation of a Alac 
strain, grown on lactose as the only carbon source. However, metabolic selection 
predisposes the system to the generation of false positives. Presumably, the prolonged 
growth under metabolic selection selects for mutant promoters, active in the absence of a 

1 5 cognate enhancer. 

We have observed that that this only occurs for library sizes exceeding 10*. Indeed, others 
have found (using a related bacterial two-hybrid system) that it is not possible to retrieve 
positive clones from dilutions higher than 1/10* by metabolic lac selection (G. Karimova, 
20 et al., (1998) Proc Natl Acad Sci USA 95, 12532-7). As it is well known that bacteria can 
develop a mutator phenotype under adaptive stress (P. D. Snicgowski, P. J. Gerrish, R. E. 
Lenski, (1997) Nature 387, 703o), we conclude that it is preferable to separate the 
selection from the amplification (growth) step in order to reduce the likelihood of 
revertants. 
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We thus replaced lacZ with the Aequorea victoria green fluorescent protein (EGFP. F64L, 
S65T, ex488 nm. em527nin (the Clontech variant pGFPmutj.l. S65E . S72A, exSOlnm, 
em5Unm FACS optimised variant was also tried, but found inferior) as the reporter gene. 
GFP has the advantage that cells can be grown first and then separated on the. basis of 
5 fluorescence using fluorescence activated cell sorting (FACS). 

We prepared a trial library of mutant GCN4 bZIP DBDs (librar>- size 10*) in which 5 key 
residues (Asn235, Ala238, Ala239, Ser242, Arg243) of GCN4 interacting with DNA were 
randomised and selected it against a GFP hybrid reporter with the cognate ATF/CREB 

10 site. Library populations were grown overnight at 34''C in non-fluorescent medium NFM 
(minimal medium supplied with 2%g]ucose. 0,2% casaminoacids,12 ng/ml L-Trp). For 
FACS (Cytomation Mofo, 488 nm Laser. FL-1 530/40 filter) an 1 ml aliquot was diluted 
lOX in NFM and the top 1% fluorescent cell population was sorted into a 96 well plate at 
I cell per well, and grown up overnight at 34''C, Cell fluorescence of the grown up clones 

15 was measured by using a SPECTRAmax^GEMINI Dual-Scanning Microplate 
Spectrofluorometer (Molecular Devices). ex480, em520, (cut-off 515 nm). Plasmids from 
fluorescent wells were sequenced afterwards. Pre- and post selection populations were 
also scored by PCR screening as well as by plating on min glu (M9 Minimal medium + 
glucose) plates and visualised usii^ fluorescence microscope. 

20 

10' cells were sorted in total, from which 219 cells were in the top 1% fluorescent 
population and 132 of which were captured to the 96-well plates. 13 cells from these were 
fluorescent. Selected positives were checked by separating the mutant GCN4-bZIP DBD 
expressor plasmids, and re-transforming them together with cognate and non-cognate 
25 reporter plasmids. None of the selected positives gave a fluorescent signal when 
combined non-cognate reporter plasmids, but all were fluorescent when combined with 
the ATF/Creb cognate reporter plasmid (which did not produce any fluorescence when 
transformed on its own). 

30 This indicates that GFP selection indeed avoids the isolation of false positives. 
Furthermore, when the librarj' was checked prior to FACS sorting no fluorescent clones 
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were identified when plating >10' cells. 1/10 clones plated post selection were 
fluorescent, suggesting a selection factor in a single round in excess of lO^-fold. 

All publications mentioned in the above specification are herein incorporated by 
5 reference. All database sequences denoted by accession or gi numbers are likewise 
incorporated by reference. 

Various modifications and variations of the described methods and system of the 
invention will be apparent to those skilled in the art without departing from the scope and 

10 spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as claimed 
should not be unduly limited to such specific embodiments. Indeed, various modifications 
of the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the following 

13 claims. 
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Claims 

1. A method for delecting a protein-nucleic acid interaction between a acid molecule 
and a protein molecule, comprising the steps of: 

5 a) providing one or more hybrid a54 activator proteins comprising a heterologous 

nucleic acid binding sequence and a constituiively active a54 transcription activating 
domain; 

b) providing one or more nucleic acid molecules comprising a binding site for the 
nucleic acid binding sequence and a binding site for a54 RNAP, which directs the 

10 expression of a reporter gene and leads to upregulation thereof in response to activation by 
the a54 transcription activating domain: and 

c) detecting expression of the reporter gene. 

2. A method accordmg to claim 1, comprising providing a repertoire of hybrid a54 
15 activator proteins, said repertoire comprising a plurality of different nucleic acid binding 

sequences. 

3 . A method according to claim 1 , comprising providing a repertoire of hybrid nucleic 
acid molecules, said repertoire comprismg a plurality of different binding sites for the 

20 nucleic acid binding sequence. 

4. A method according to claim L comprising providing both a repertoire according 
to claim 2 and a repertoire according to claim 3. 

25 5. A method for detecting a protein-protein interaction, comprising the steps of: 

a) providing a first hybrid protein comprising a nucleic acid binding sequence and a 
first polypeptide sequence bait; 

b) providing a second hybrid protein comprising a prey polypeptide sequence and 
constitutively active a54 transcription activating domain; 

30 c) providing a nucleic acid molecule comprising a binding site for the nucleic acid 

binding sequence and binding site for a54 RNAP which directs the expression of a 
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reporter gene and leads to upregulaiion thereof in response to activation by the c54 
transcription activating domain; 

d) incubating the first and second hybrid proteins together with the nucleic acid 
molecule such that the prey and bait pol>peptide sequences may bind, thereby forming a 

5 hybrid protein comprising both a nucleic acid binding sequence and a a54 transcription 
activating domain; and 

e) detecting expression of the reporter gene, 

6. A method according to claim 5, comprising providing a repertoire of first hybrid 
10 proteins, said repertoire comprising a plurality of bait polypeptides. 

7. A method according to claim 5, comprising providing a repertoire of second hybrid 
proteins, said repertoire comprising a plurality of prey polypeptides. 

15 8. A method according to claim 5, comprising providing a repertoire of first hybrid 
proteins and a repertoire of second hybrid proteins, said repertoires comprising a plurality 
of bait and prey polypeptides, 

9. A method for screening a repertoire of candidate DNA-bending 
20 polypeptides, comprising the steps of: 

a) providing a repertoire of candidate polypeptide factors with potential to induce 

bending of DNA; 

b) providing a ct54 activator protein comprising a nucleic acid binding sequence 
and a a54 transcription activating domain; 

25 c) providing a nucleic acid molecule comprising a binding site for the nucleic acid 

binding sequence and binding site for a54 RNAP which directs the expression of a 
reporter gene and leads to upregulation thereof in response to activation by the a54 
transcription activating domain; 

d) incubating the repertoire and a54 activator together wdth the nucleic acid 

30 molecule in a HIF' host cell, such that a54 activator and the nucleic acid molecule may 
interact, and transcription activated firom the a54 RNAP binding site in a manner 
dependent on DNA bending by the polypeptide factors; and 
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e) detecting expression of the reporter gene. 

10. A method according to any preceding claim, wherein the polypeptides are obtained 
by expression within a bacterial host cell. 

5 

11. A method according to claim 10. wherein the pol\peptides are encoded one or 
more libraries of nucleic acid vectors. 

12. A method according to claim IK wherein a first library of nucleic acid vectors 
10 encodes a first chimeric gene, said gene comprising a nucleic acid sequence that encodes a 

nucleic-binding domain and a nucleic acid sequence encoding a first (bait) test protein or 
protein fragment in such a manner that the first test protein is expressed as part of a hybrid 
protein with the nucleic acid-binding domain. 

15 13. A method according to claim 11, wherein a second library of nucleic acid vectors 
encodes a second chimeric gene, said gene comprising a nucleic acid sequence that 
encodes a a54 transcriptional activation domain and a second (prey) test protein or protein 
fragment into the vector, in such a manner that the second test protein is capable of being 
expressed as part of a hybrid protein with the transcriptional activation domain. 

20 

14. A method according to any preceding claim, wherein the a54 transcriptional 
activator is selected from the group consisting of: 

dbj|BAA16379.1| (D90877) FORMATE HYDROGENLYASE TRANSCRIPTIONAL 
ACTIVATOR; 

25 emb|CAA26472.l| (X02616) pot. Nifa gene product (aa 1-484) [Klebsiella pneumoniae]; 

emb|CAA53584.l| (X75972) anfa [Rhodobacter capsulatus]; 

emb|CAA92413,l| (Z68203) nifa homologue [Rhizobium sp.]; 

emb|CAA93242.1| (Z69251) mopr [Acinetobacter calcoaceticus]; 

emb|CAB53157.1| (X07567) nifal [Rhodobacter capsulatus]; 
30 embIC AB56537. 1 1 ( AJ249642) response regulator [Pseudomonas stutzeri]; 

gb|AAA58220.1| (U18997) ORF^o532 [Escherichia coli]; 

gb| AA A99303 . 1 1 (L43 064) regulatory protein [Pseudomonas aeruginosa] : 
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gb|AAfi9 1397.1 1 (AF033203) nifaii protein (Rhodobacier capsulatus]; 
gb|AAC05586.1| (AF006075) regulatory protein [Bacillus subtilis]: 
gb|AAC37124.1| (L81 176) fleq [Pseudomonas aeruginosa]; 
gblAAC45640.1| {AF010585) putative sigma 54 activator [Caulobacter crescentus]; 
5 gb|AAC46367.1t (AF0141 13) two-component response regulator [Vibrio cholerae]; 

gb|AAD34591.1|AF145956_l (AF145956) transcriptional aciivatornifa [Rhodospirillum 
rubrum]; 

gb|AAD38416.l| (AF155934) nifa [Alcaligenes faecalis]; 
gb|AAF28395.1| (AF069392) flam [Vibrio parahaemolyticus]; 
10 gb|AAF33506.1| (AF170176) Salmonella typhimurium transcriptional regulatory protein; 
gb|AAF6 1932.1 1 (AF230804) sigma-54 activator protein Actl [Nhocococcus xanthus]; 
gb|AAF85342.1|AE004061_7 (AE004061) two-component system, regulatory protein 
[Xylellafastidiosa); 

gb|AAF94676.1| (AE004230) sigma-54 dependent response regulator [Vibrio cholerae]; 
15 gb|AAF95280.1| (AE004286) sigma-54 dependent response regulator p/ibrio cholerae]; 
gb|AAF96095.1| (AE004358) sigma-54 dependent transcriptional regulator [Vibrio 
cholerae]; 

gb|AAG01527.1|AF288483 J (AF288483) nifa [Azospirillum brasilense]; 

pir||A48291 ornithine decarboxylase inhibitor - Escherichia coli; 
20 pir||B49940 nitrogen regulator I homolog - Escherichia coli; 

pir||C70320 transcription regulator nifa family - Aquifex aeolicus; 

pir||C70396 transcription regulator ntrc family - Aquifex aeolicus; 

pirl|C70454 transcription regulator ntrc family - Aquifex aeolicus; 

pir||D70315 transcription regulator ntrc family - Aquifex aeolicus; 
25 pirl|H6958 1 transcription activator of acetoin dehydrogenase operon acor - Bacillus 

subtilis; 

pirl|I39719 nitrogen regulatory protein - Agrobactcrium tumefaciens; 
pirl|JC547l regulatory protein nifa - Azospirillum lipoferum; 
pirl|T08624 probable ntrc-type response regulator - Eubacterium acidaminophilum; 
30 sp|P03027|NIFA,KLEPN NIF-SPECIFIC REGULATORY PROTEIN; 
sp|P09570|NIFA^AZOVINIF-SPECIFIC REGULATORY PROTEIN; 
sp|P126271VNFA,AZOVI NITROGEN FIXATION PROTEIN VNFA; 
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sp|P14375|HYDG_ECOLl TRANSCRIPTIONAL REGULATORY PROTEIN HYDG; 
sp|P21712|YFHA_EC6LI HYPOTHETICAL 49:1 KD PROTEIN IN GLNB-PURL 
INTERGENIC REGION (ORFXB); 

sp|P244261NIFA_RHILT NIF-SPECIFIC REGULATORY PROTEIN; 
5 sp|P25852|HYDG_SALTY TR.\NSCRIPTIONAL REGULATORY PROTEIN HYDG; 

sp|P27713|NIFA_HERSE NIF-SPECIFIC REGULATORY PROTEIN; 

sp|P30667|NIFA_AZOBR NIF-SPECIFIC REGULATORY PROTEIN; 

sp|P38035|RTCR_ECOLI TRANSCRIPTIONAL REGULATORY PROTEIN RTCR; 

sp|P54929|NIFA_AZOLI NIF-SPECIFIC REGULATORY PROTEIN; 
10 sp|P56266|NIFA_KLEOX NIF-SPECIFIC REGULATORY PROTEIN; 

sp|Q06065|ATOC_ECOLI ACETOACETATE METABOLISM REGULATORY 

PROTEIN ATOC (ORNITHINE/ARGININE; 

spiQ46802|YGEV_ECOLI HYPOTHETICAL SIGMA-54-DEPENDENT 
TRANSCRIPTIONAL REGULATOR IN; 
15 sp|Q53206|NIFA^RHISN NIF-SPECIFIC REGULATORY PROTEIN; and 

sp|Q9ZIB7|TYRR_ERWHE TRANSCRIPTIONAL REGULATORY PROTEIN TYRR. 

15. A method according to any one of claims 1 to 14, wherein the a54 transcriptional 
activator is the Nif A transcriptional activator or the PspF transcriptional activator. 

20 

16. A method according to any one of claims 1 to 14, w^ierein the hybrid c54 
transcriptional activator is NifA and activation resulting from NifA-a54 RNAP interaction 
is enhanced by the coexpression of wild-type or mutant NifA. 

25 17. A method according to claim 1 6, wherein the hybrid a54 transcriptional activator is 
NifA from Azotobacter vinelandii, and the wild-type or mutant NifA is NifA from 
Klebsiella pneumoniae. 

18. A method according to any preceding claim, wherein the nucleic acid molecule 
30 comprises a binding site for a factor which induces DNA bending. 

19. A method according to claim 1 8, wherein the factor is integration host factor (IHF). 
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20. A method according to any one of claims 1 to 17, wherein the nucleic acid 
molecule comprises DNA that is intrinsically bent 

5 21. A method according to any preceding claim, wherein the nucleic acid molecule 
comprises a nifH promoter from A. vinelandii driving a reporter gene. 

22. A method according to any preceding claim, wherein the reporter gene is selected 
from the group consisting of metabolic markers such as the lac operon (lacZ, lacY and 

10 lacA): proteins conferring a fluorescent phenotype, such as GFP; proteins conferring 
antibiotic resistance, such as Zeo; and proteins conferring another selectable property. 

23. A method according to any preceding claim, which is carried out in the presence of 
a compound which modifies protein-protein or protein-DNA interaction. 

15 

24. A method according to claim 22, wherein the compound is selected from the group 
consisting of molecules which alter the structure of the DNA-binding protein; molecules 
which alter the structure of DNA; and molecules which modify protein-protein 
interactions. 

20 

25. A method according to any preceding claim, which is carried out in vivo, 

26. A method according to claim 25, wherein the in vivo host is E. coli. 



25 27. 



A method according to any one of cldms 1 to 24, which is carried out in vitro. 
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